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Glossary 

Facade enclosure envelope Throughout this entry, 
Facade is a subset of Enclosure and Enclosure is 
considered the same as Envelope. 


Carbon neutral net-zero and the 2030 challenge 

Carbon-neutral buildings and net-zero buildings 
are typically defined as buildings designed to 
achieve net-zero in green house gas emissions dur¬ 
ing their lifetime. This has been set as a requirement 
for all new buildings by 2030 in the 2030 
Challenge™ (url). 

Passive survivability Environmental Building News 
uses the term passive survivability to describe a 
building’s ability to maintain critical life-support 
conditions in the event of extended loss of power, 
heating fuel, or water, or in the event of extraordi¬ 
nary heat spells. 

Biophilia Introduced by E.O. Wilson, the biophilia 
hypothesis suggests that there is an instinctive 
bond between human beings and other living sys¬ 
tems that must be met by buildings that ensure 
critical connections. 

Ambient and task lighting Ambient lighting is the 
light needed for safe movement and non-visual 
tasks, while task lighting is linked to visual acuity 
at task and must be glare-free. Daylight can be used 
for both, but requires significantly greater design 
resolution for task lighting. 

Mixed-mode or Hybrid HVAC An approach to space 
conditioning that combines natural ventilation 
from operable windows or vents with mechanical 
heating, cooling, and ventilation systems (HVAC). 

Night ventilation An approach to space conditioning 
that utilizes cool night air to precool the buildings’ 
thermal mass in order to reduce daytime cooling 
demands. 

Thermal bridge A conductive or non-insulated ele¬ 
ment in the facade assembly that allows heat to be 
unnecessarily lost or gained, and allows condensa¬ 
tion to occur, to be addressed by well-designed 
thermal breaks. 

PassivHaus™ A standard that codifies energy conser¬ 
vation and passive conditioning requirements for 
residential and small commercial buildings. 


Robert A. Meyers (ed.), Encyclopedia of Sustainability Science and Technology, DOI 10.1007/978-1-4419-0851-3, 
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Double envelope The introduction of two facades 
with a captured air space and static or dynamic 
components that support orientation- and season- 
specific modulation of climate conditions. 
Rainscreen An independent layer on the facade to 
manage weather challenges, separated by vent 
spaces from the structural, thermal, and vapor 
management of the primary facade. 

Definition of the Subject 

Sustainable buildings have enclosures that ensure the 
highest level of thermal, visual, acoustic, and air qual¬ 
ity, as well as design for change, through system flexi¬ 
bility, and for long-term building integrity (Fig. 1). 
Sustainable enclosure design clearly demonstrates 
advances in enclosure components and assembly 
design, as well as their effective integration with inte¬ 
rior systems, structural systems, mechanical condition¬ 
ing systems, lighting systems, telecommunications and 
power systems. Sustainable enclosures effectively 
address ten major responsibilities, indeed areas of 
major innovative opportunity, in the pursuit of aes¬ 
thetics, comfort, and resource conservation: 

1. Access to the natural environment 

2. Daylighting 


3. Natural ventilation 

4. Heat loss/heat gain control 

5. Solar control 

6. Load balancing 

7. Passive and active solar 

8. Water 

9. Enclosure life 

10. Systems integration 

In addition to performance goals for building roofs 
and exposed floors, the design of high-performance 
building enclosures must fully resolve at least 12 facade 
layers: the transom, viewing field, kick plate, and span¬ 
drel (top to bottom) and the interior, integral and 
exterior layers (inside to out) to resolve each of the 10 
performance goals (The relative importance of facade 
components and layers to each performance outcome 
will be highlighted in graphics with deepest shading for 
the most important design decisions.) 

Introduction 

Architecture Unplugged: Superefficient Buildings 
that Environmentally Surf 

Carbon-neutral buildings, net-zero buildings, and the 
2030 challenge. To meet these goals, the next generation 
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Diagram illustrating 12 areas in the facade critical to various performance outcomes 





































Facades and Enclosures, Building for Sustainability 


3677 



of buildings must be designed to use so little energy 
that on-site renewable energy can fully meet the 
heating, cooling, and lighting loads - loads that repre¬ 
sent over 35% of all US energy. 

For the first generation of projects, these challenges 
are being met by tight buildings, with superefficient 
equipment, fed in part by solar installations. While 
this will help ensure a 30% reduction, it will never 
achieve carbon neutrality. 

Carbon neutrality is critically dependent on the 
maximum use of nature’s renewable energies - day¬ 
light, natural ventilation, natural cooling, and passive 
solar heating. Building systems will need to be turned 
off as long as possible - ensuring buildings that “free- 
roll” or better yet “environmentally surf” through 
hours, days, months, and seasons with natural condi¬ 
tioning. The beauty of free-rolling buildings goes 
beyond energy and carbon, however, to support health, 
productivity, and a higher quality of life. 

The most critical early design decisions relate to 
building massing - the height, depth, and orientation 
of buildings - which has a major impact on indoor 
environmental quality and energy loads. Neither “the 
tallest building in the world” nor “the largest building 
under one roof” will offer sustainable work environ¬ 
ments in a power failure. Indeed, these buildings create 
significantly higher-energy loads in almost every cli¬ 
mate since they eliminate the use of daylight, natural 
ventilation, or the natural dissipation of internal heat 
gains through the building skin [ 1 ], and must be aban¬ 
doned in a power outage. If air quality, comfort, and 
energy are drivers for building form, then the next 
generation of buildings will strive toward controlling 
the building depth, height, and orientation to achieve 
environmental comfort for a maximum percentage of 
the year - unplugged. 

“Architecture Unplugged” would require enclo¬ 
sures that demonstrate serious attention to the man¬ 
agement of solar gain, heat transfer, moisture 
migration, and day-night load balancing - a form of 
environmental surfing that depends on dynamic 
facades that recognize time of day, season, and variabil¬ 
ity of climate. These mass, color, venting, and thermal 
insulation characteristics are key to energy conserva¬ 
tion in buildings, and require entirely regional solu¬ 
tions. Heavy capacitance masonry facades will be seen 
in desert climates with large diurnal swings; heavily 


shaded and highly operable facades will be seen in 
warm, humid climates; and solar exposed yet well- 
insulated facades will be seen in cold climates. In each 
climate, the massing, orientation, and selection of 
facade materials will be regional in character, providing 
the maximum natural comfort before mechanical sys¬ 
tems need to be introduced (Figs. 2 and 3). 

Facades that Maximize Individual Access to the 
Natural Environment 

The most significant step that designers, engineers, 
and building managers can take to move toward 
healthier and more sustainable buildings is the com¬ 
mitment to increasing individual contact with the 
outside environment. The emerging field of “bio- 
philia” has identified the importance of access to 
nature to human well-being [2-5], exploring a range 
of qualities of the natural environment: views, day¬ 
light, sunlight, fresh air, breezes, access to outdoor 
spaces and activities, circadian rhythms, seasonal and 
daily climate variations, and nature’s sounds, smells 
and habitat. Windows and doors connect building 
occupants with a richness that may be critical to indi¬ 
vidual health. At the same time, windows and doors 
provide those outside the building with a level of 
transparency, oversight, and contact that may be crit¬ 
ical to community. 

Europe and Scandinavia have guaranteed every 
worker seated access to a window with views, setting 
a maximum distance of 7 m, or 20 ft from the window 
wall. In Switzerland and Scandinavia, they have fur¬ 
ther guaranteed access to operable windows in each 
workplace [6]. With these standards, buildings typi¬ 
cally do not exceed 60 ft from window wall to window 
wall, and always maintain operable, dynamic facades 
instead of sealed, seasonally unchanging building 
enclosures. 

Driven by the advent of air conditioning and inex¬ 
pensive electricity, the “obese” buildings of today 
(maximum volume - minimum surface area buildings) 
have actually resulted in a dramatic increase in the 
length of the cooling “season” for buildings in all cli¬ 
mates. They have also resulted in a significant increase 
in lighting loads during daylight hours, in combination 
often contributing more than 50% of the total energy 
demand in buildings. The study of the Building 
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Facades and Enclosures, Building for Sustainability. Figure 2 

Deep energy conservation combined with passive conditioning including daylighting, natural ventilation, and passive 
solar can dramatically reduce energy consumption 
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Critical facade elements to maximize access to the natural environment 
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Research Establishment of a portfolio of UK office 
buildings clearly illustrates the energy penalty of sealed, 
deep floor plan “prestige” buildings [7]. 

While there is significant debate about the impor¬ 
tance of indoor daylight or sunshine for human health 
or performance, there seems to be a growing consensus 
that access to views is significant. Beginning with 
the seminal work of Ulrich [8], then Mendell [9], 
Heschong Mahone Group [10], and now Kellert [4], 
seated views of nature and proximity to windows 
are being linked to reduced length of stay after 
surgery, reduced sick building syndrome, increased 
performance at task, and overall improved emotional 
health. 

In a 1990 survey of over 2,000 employees in two 
buildings at the US Department of Energy, Carnegie 
Mellon University’s Center for Building Performance 
identified 10-20% lower sick building symptoms 
among employees with seated views of windows, 
controlling for rank [11]. Whether user perception 
of personal health is improved due to the light, the 
view, the perimeter conditioning systems, or an 
increased level of environmental control (blinds, 
HVAC controls) at the window is unclear. Regardless, 
there is a measurable benefit to ensuring that a 
workforce has fewer health symptoms across the 
board (Fig. 4). 

Two case studies that frame the conclusion that 
views are a significant factor in health and pro¬ 
ductivity are captured below from the Carnegie 
Mellon Building Investment Decision Support Tool 
(BIDS™) [7, 8, 10]: 


Window View of Nature = Health 


In a 1984 observational field study at a Pennsylvania 
hospital, Ulrich identifies an 8.5% reduction in 
postoperative hospital stay (8.7 days vs. 7.96 days) for 
gall bladder surgery patients who had a view of a 
natural scene from their hospital room, as compared 
to those with a view of a brick wall. Patients with 
a view of a natural scene also received fewer negative 
evaluations from nurses and took fewer strong 
analgesics (Pennsylvania Hospital/Ulrich [8]). 




Facades and Enclosures, Building for Sustainability. 
Figure 4 

Window proximity correlates to a 5-25% reduction in 
health complaints among 2,000 workers in two US office 
buildings, controlled for rank [11] 


Window View of Nature = Productivity 


In a 2003 building case study of the Sacramento 
Municipal Utility District (SMUD) Call Center, 
Heschong et al. identify a 6-7% faster Average Handling 
Time (AHT) for employees with seated access to views 
through larger windows with vegetation content from 
their cubicles, as compared to employees with no view 
of the outdoors (SMUD/Heschong Mahone Group [10]). 


These studies illustrate the value of windows with 
views for all occupied spaces, alongside effective design 
of window size and orientation, sight lines, and view 
content. The studies suggest that views of the outdoors 
should be with “biophilic” content - views of pedes¬ 
trians and trees and community life - to critically 
maintain our sense of time and season. At the same 
time, it will be imperative for designers to ensure that 
requirements for glare control, heat gain and noise 
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control, privacy and security are equally met—the def¬ 
inition of quality design (Fig. 5). 

In pursuing design for sustainability, the challenge 
to designers is not only to connect indoor spaces with 
the outdoors visually, but to physically connect build¬ 
ing occupants with nature - each sites unique climate 
and seasons, the textures, sounds, smells, and diversity 
of landscape and species. This not only suggests a shift 
away from mega-plexes and high-rise buildings toward 
open-air campus and village planning, but a commit¬ 
ment to operable windows, and distributed doors, ter¬ 
races, and landscapes. 

The physiological and psychological benefits of an 
ongoing connection to the locally unique and season¬ 
ally dynamic natural environment still need quantifi¬ 
cation. For example, studies in early childhood 
development reveal the academic and emotional bene¬ 
fits of natural playgrounds as opposed to hard-top play 
yards [12]. Sustainable buildings must incorporate 
physical access to outdoor spaces for a host of reasons, 
including physical exercise, exposure to sunshine and 
circadian rhythms, variations in thermal stimuli, and 
access to community (Fig. 6). 


Facades that Ensure Daylighting as the Dominant 
Source for Both Task and Ambient Lighting 

► "A building with lights on during the daytime is not 

a sustainable building." 

Electric lighting is almost 25% of our primary 
energy use and over 35% of all electric energy use in 
the USA [ 13] - and much of that is during the daytime! 
Energy-efficient lamps, ballasts, and fixtures are obvi¬ 
ous first steps for ensuring 30% lighting energy savings, 
and daylight responsive and occupancy responsive con¬ 
trols for these fixtures can reduce the next 20% (refer¬ 
ence: see [14]). The giant leap, however, is achieved by 
buildings designed for daylighting as the dominant 
light source—environmentally “surfing” for 90% of 
the day without electric demand for lighting. This is 
more than rediscovering the design expertise of the 
nineteenth century. Good daylighting design demands 
effective integration of new glazing technologies, intro¬ 
ducing light redirection and shading layers that enrich 
and regionalize architecture. Critically, buildings must 
be articulated, in finger or courtyard shapes, to ensure 
universal access to daylight. 



Facades and Enclosures, Building for Sustainability. Figure 5 

Designing window dimensions, orientations, and sight lines as well as adjacent landscaped spaces (with substantial storm 
water management benefits) are critical to ensuring access to views with "biophilic" content 
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Facades and Enclosures, Building for Sustainability. Figure 6 

Critical facade elements to ensure daylighting as the dominant light source 


There are two main alternatives in the use of daylight 
in work environments. The first is the use of daylight for 
ambient lighting requirements of 200-300 lx and for 
spatial highlights. The second is the use of daylight as 
the dominant task lighting source at the work surface. 
This second alternative offers the greatest energy savings 
but also the greatest design challenge, since both the 
daylight “fixture” and the electric lighting interface 
need to be effectively designed. The daylight fixture - 
window, transom, or skylight - will need appropriate 
orientation, size, reflector-frame design and lens-blind 
design as well as the corresponding design of the room to 
ensure effective light distribution without glare. 

There are a broad range of traditional and innovative 
technologies to be considered for effective daylighting of 
buildings to achieve both ambient and task lighting 
needs. New developments in glazing enable architects 
to specify high visual transmittance (short wave energy) 
while controlling solar transmittance and heat loss (long 
wave energy). Given innovations in glazing technology, 
it is possible to achieve high light transmittance over 
50% while maintaining shading coefficients below 40% 


in cooling load-dominant climates, or above 60% in 
heating load-dominant climates. 

However, sustainable design recognizes that not all 
shading should be solved with the glass selection, since 
this can rob spaces of daylight and solar heat where 
needed. 

Each window must be designed as a lighting 
“fixture” in relation to space configuration and layout. 
If lighting engineers can design effective reflectors and 
diffusers for point and linear sources, they can design 
for planar sources as well. Transom glazing will be 
critical to effective distribution, combined with exte¬ 
rior, integral, or interior light redirection devices to 
ensure a more even distribution of daylight and to 
reduce glare from the window. If there is a shortage of 
window access, it should never be given to circulation 
aisles in a sustainable design (Fig. 7). 

The rules for effective daylighting in offices, 
schools, and hospitals have long been established, relat¬ 
ing to percent of aperture, ceiling height, room depth 
and color, as well as glare control and light redirection 
devices at the window. Today, high-performance 






























3682 


Facades and Enclosures, Building for Sustainability 




Facades and Enclosures, Building for Sustainability. 
Figure 7 

Daylight is celebrated at Reagan National Airport, 
Washington DC 


enclosures go even further, fully exploring the benefits 
of a layered facade. The design of layered facades 
enables seasonally and daily dynamic control of the 
light, heat, and ventilation energies available in the 
natural environment. All three layers should be 
designed together - external shading and light shelves 
to distribute daylight and reduce high sun angle glare; 
integral high visible transmittance glazing with climate 
responsive or dynamic shading coefficients; and inter¬ 
nal light redirection blinds that sustain views while 
diminishing low sun angle glare. It is critical that the 
design of these layered facades be entirely regional in 
character, and suited to the function of the building, 
as they control light, glare, solar heat, heat loss/gain, 
even thermal mass. By displacing mechanical and elec¬ 
trical loads, these facades provide long-term energy 
savings. In retrofitting deep buildings for daylighting, 


the sustainable design community is exploring atria, 
light wells, light ducts, and light pipes. Bank of America 
in San Francisco cut an atrium into an overly deep 
building to bring daylight to all spaces. Innovations in 
light pipes, reflective ducts, and tracking/beaming 
heliostats all have potential for increasing daylight 
effectiveness and delight (Fig. 8). 

At the same time, the daylighting “system” must 
give the appropriate information to the electric light¬ 
ing interface so that it can offer the appropriate “infill” 
through continuous dimming control. Studies indi¬ 
cate that the selection, placement, and installation 
of the photo-sensor is critical, concluding that a 
ceiling-mounted photosensor shielded from direct 
window light will ensure the best correlation with 
daylight work-plane illuminance [15]. The net impact 
of daylighting and electric lighting integration is sig¬ 
nificant energy savings [16] and improved workplace 
satisfaction. 

Emerging research is revealing that day-lit offices, 
classrooms, and hospitals may measurably contribute 
to greater health and performance at task. Light levels 
can be higher without energy penalty; full spectrum 
light offers rich color rendition and 3-dimensional 
modeling; circadian rhythms set by daylight variations 
throughout the day trigger melatonin production 
and sleep patterns; and views meet fundamental 
human needs. 

Sustainable buildings are designed for daylight to 
meet all ambient and task requirements during the day. 
The investment in a layered facade for daylight effec¬ 
tiveness will simultaneously support natural ventila¬ 
tion, night ventilation, thermal load balancing, and 
passive solar heating - the full complement of regional 
opportunities for natural conditioning. 


Daylight Lighting Control = Energy Savings 


In a 1997 before and after building case study of the 
New York City federal building, the Electric Power 
Research Institute (EPRI) identifies 64% lighting energy 
savings and net HVAC energy savings of 0.07 kWh per 
square foot annually following the installation of an 
energy-efficient lighting system with daylight dimming 
and lumen maintenance controls (NYC Federal 
Building/ EPRI [16]). 
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Facades and Enclosures, Building for Sustainability. Figure 8 

The central atrium and "sky gardens" at Commerzbank in Frankfurt bring daylight to all spaces and provide occupants with 
access to nature in a dense urban environment 


Daylight Spectrum and Timing = Individual 
Productivity 


In a 1997 controlled experiment, Boyce et al. identify 
a 1.6-12.8% improvement in night-shift workers' 
performance on short-term memory and logical 
reasoning tasks under large skylight-simulating fixtures 
with hidden fluorescent lamps, capable of providing 
fixed or variable illuminance from 200 lx to 2800 lx. 
Performance was enhanced by fixed high illuminance 
of 2800 lx and by steadily decreasing illuminance that 
simulated daylight from midday to dusk, as compared 
to fixed low illuminance of 250 lx or steadily increasing 
illuminance that simulated daylight from dawn to 
midday (Boyce et al. [17]). 


Facades that Ensure Natural Ventilation as the 
Dominant Fresh Air and Cooling Source 
Combined with Mixed-Mode HVAC Conditioning 

► "A sealed building is not a sustainable building." 

The use of natural ventilation and natural cooling 
with direct outside air for swing and winter seasons 
ensure affordable high levels of fresh air with the 
greatest energy efficiency (Fig. 9). International studies 
consistently link increases in outdoor air supply to both 
productivity and health gains in the workplace [9]. 


Natural Ventilation = Health 


In a 1991 multiple building study of 880 workers 
in eleven office buildings in the San Francisco bay 
area, Mendell identified 20-81% reductions in reported 
SBS symptoms for occupants of naturally ventilated 
buildings as compared to occupants of office buildings 
with air conditioning or mechanical ventilation 
(CA Healthy Building Study/ Mendell [9]). 


Natural Ventilation = Health + Individual 
Productivity 


In a 2004 multiple building study of professional 
middle-aged women in France, Preziosi et al. identify a 
57.1% reduction in absenteeism, a 16.7% reduction in 
medical services use (doctor visits), and a 34.8% 
reduction in hospital stays among subjects with natural 
ventilation in their workplace, as compared to those 
with air conditioning (Preziosi et al. [18]). 

In addition, the substantial opportunity for “free 
cooling” through the use of outside air is significantly 
underestimated by calculations focused on only “com¬ 
fort zone” natural ventilation. Natural cooling is pos¬ 
sible whenever outside conditions are within or below 
the comfort zone both in the day and at night, as long 
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Facades and Enclosures, Building for Sustainability. Figure 9 

Critical facade elements to ensure natural ventilation as the dominant fresh air and cooling source 


as drafts can be effectively managed through mixing or 
heat recovery. The use of both passive (open windows) 
and active economizer cycles can save from 50% to 
100% of the cooling demand in various regions of the 
USA, when including the benefits of nighttime cooling 
of the building’s thermal mass [19]. 


Natural Ventilation = Individual Productivity + 
Energy Savings 


In a 2002 multiple building study of 39 Australian office 
settings, Rowe identifies a 13.8% perceived productivity 
improvement, a 15% improvement in overall comfort, 
and 79% annual HVAC energy savings due to replacing 
a mechanical air-conditioning system with mixed-mode 
HVAC that supports both natural ventilation and air 
conditioning (Australian Offices/ Rowe [20]). 

Design for effective natural ventilation and natural 
cooling of the building mass and its occupants requires 
multi-disciplinary expertise to ensure that building 
shape, orientation, atria/courtyards, chimneys, window 


types (awning, casement, double hung), sizes, and loca¬ 
tions ensure effective cross and stack ventilation. Cross 
and stack ventilation demands that building form and 
space layout reflect prevailing wind directions and the 
potential for thermal and solar assist, often studied 
through wind model testing or computational fluid 
dynamic simulations. In climates with day-night tem¬ 
perature swings, night ventilation should be explored 
and building materials or storage selected to store 
cooler temperature, creating a thermal “flywheel” for 
the following day. 

In many climates, “mixed-mode” HVAC systems are 
designed to support natural ventilation and to provide 
air conditioning and ventilation - when and where 
natural ventilation is inadequate. There are three 
types of mixed-mode HVAC systems [7]: 

• Concurrent systems, which use natural ventilation 
and mechanical HVAC simultaneously. Occupants 
are free to open windows and the HVAC system 
provides supplemental ventilation, dehumidifica¬ 
tion, and cooling, while an advanced control system 
coordinates zone air supply rates with window 
positions. 
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• Changeover systems, where the building alternates 
between natural and mechanical mode on 
a seasonal or daily basis. 

• Zoned systems, in which different conditioning 
strategies are used simultaneously in different 
zones of a building. 

Concerns related to outdoor air pollution, pollen, 
and humidity can be effectively addressed with well- 
designed mixed-mode conditioning systems (Fig. 10). 
Natural ventilation in high-rise buildings demands 
both architectural and engineering innovations to 
manage both pressurization issues and stack effect. 
This has been accomplished through the design of 
stacked 6-8-story buildings such as the Commerzbank 
in Frankfurt, and through double envelope designs, 
where wind and pressure are moderated through inter¬ 
mediate facade spaces. 

For both natural ventilation and mechanically 
assisted ventilation, it is critical to engineer ventilation 
effectiveness , ensuring adequate fresh air volumes 
through large apertures or economizers; accessible, 


large volume fresh air paths from air-intake locations 
to occupants, such as in displacement ventilation sys¬ 
tems; multiple fresh air sources with operable windows, 
distributed air handlers or facade integrated HVAC; 
and adequate exhaust without recirculation or contam¬ 
ination of the air flow paths. Air intakes especially 
should be strategically located, not near loading docks 
with idling engines or on roof decks with standing 
water, but in non-polluted areas, without standing 
water or solar overheating. Indeed, air sources should 
be freshest where prevailing breezes travel through 
shaded, landscaped areas with some level of natural 
air purification and cooling. 

To achieve effective natural ventilation with mixed¬ 
mode mechanical conditioning, the design team must 
work together to engineer and integrate the natural 
ventilation/natural cooling controls and the HVAC 
controls to maximize air quality, user control, passive 
conditioning, and energy efficiency. Given a growing 
body of research illustrated below, natural ventilation 
with mixed-mode conditioning should be the norm in 




Facades and Enclosures, Building for Sustainability. Figure 10 

Operable windows at Commerzbank, a high-rise structure, provide natural conditioning for the occupants, linked with 
mechanical cooling in a hybrid system. The atrium at Institute for Forestry and Nature Research in Wageningen 
provides a protected space for natural ventilation in the offices while giving occupants a natural setting for meetings 
and relaxation 
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all commercial buildings for human health and pro¬ 
ductivity, energy efficiency, and the viability of occu¬ 
pied spaces in the face of electricity shortages. 

Facades that Minimize Enclosure Heat 
Loss/Heat Gain 

One of the earliest decisions affecting the sustainability 
of buildings is the overall massing of the building and 
the orientation of major facade and roof surfaces 
(Fig. 11). While very harsh climates might benefit 
from minimum surface area to usable space enclosed, 
there are climate-specific arguments for increasing 
building enclosure for daylighting, natural ventilation, 
passive solar heating, and even the dissipation of exces¬ 
sive internal heat gains (see Load Balancing). These 
increased exposures must be strategic, however, 
avoiding solar overheating, cold air infiltration, and 
increased heat loss or heat gain. Serious re-evaluation 
of vast roof exposures and cantilevered floors is needed, 
minimizing excessive heat losses or heat gains unless 
they have corresponding benefits from natural condi¬ 
tioning. Even the location of the HVAC system will be 
significant, with rooftop and west facing cooling 


equipment measurably less efficient due to solar heat 
gains. Design goals should be set for building shape and 
orientation to ensure the best integration of passive 
conditioning while reducing environmental loads. 

Architects make major decisions about the thermal 
performance of buildings through the specification of 
window to wall ratios. Sustainable designers weigh heat 
loss and heat gain against daylighting, natural ventila¬ 
tion, passive solar heating, and load balancing through 
the facade, alongside the spatial flexibility needed by 
dynamic organizations. Innovative building enclosures 
are designed to act like the human skin, dynamically 
supporting heat, air, and moisture dissipation for the 
health of the occupants in changing climates. The sus¬ 
tainable design community has rediscovered the 
opportunities for dynamic thermal performance of 
building enclosures, with high-resistance louvers, cur¬ 
tains, and dynamic layered facades changing heat trans¬ 
fer characteristics hour by hour, day to night, or season 
by season. 

Needless to say, the highest level of thermal resistance 
to heat loss and heat gain is pursued for both the wall 
and window assemblies in sustainable buildings. Inno¬ 
vations in wall insulation have supported shifts from 
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Critical facade elements to minimize enclosure heat loss/heat gain 
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R5 per inch to RIO per inch - a doubling of perfor¬ 
mance specifications. Window innovations also 
abound, with R2 double glazed assemblies replaced by 
R4, R8, and even R12 assemblies as a result of low-e, gas 
filled assemblies with innovative spacer and frame tech¬ 
nologies, even while visible transmission is critically 
kept above 50%. At the same time, low solar heat gain 
coefficients (SHGC) of 0.35-0.50 can be achieved 
where needed to reduce overheating at the facade with¬ 
out loss of visual connection to the outside. At a very 
minimum, these three window variables - resistance to 
heat transfer (R), visible transmission (t v ), and solar 
heat gain coefficients (SHGC, solar transmission) - 
must be balanced in response to each climate and 
building type. 


Facade Temperature Control = Individual 
Productivity 


In a 2000 field case study of telecommunication office 
workers in Finland, Hannula et al. identify a 2.8% 
increase in measured productivity in north-facing 
offices with an average temperature of 23.6°C (range 
21.9-27.8°C) as compared to south-facing offices with 
an average temperature of 25.2°C (range 22.8-28°C), 
supporting the need for improved heat gain control 
by orientation (Telecommunication Office/ Hannula 
et al. [21]). 

Finally, sustainable enclosure design detailing 
ensures the lowest level of thermal bridging through 
walls, roofs, floors, and foundations. Structural and 
enclosure designers must work together to ensure the 
elimination or minimization of structural elements 
that connect indoor heated or cooled areas with the 
outdoors, including the facade, roof, floor, and foun¬ 
dation. Ideally the thermal “wrapper” is continuous 
and fully outside the structural frame, except at foun¬ 
dation connections that might indeed support ground 
source conditioning (see Thermal Load Balancing). 
Cantilevered balconies, overhangs, floors, and roof ele¬ 
ments can contribute to significant heat loss, moisture 
and vapor migration, and condensation that lead to 
building degradation (see Enclosure Life). Not only the 
structural integration, but also product specification, 
enclosure detailing, and design for constructability are 


critical to the thermal performance of facades. There 
are numerous conditions where well-insulated facades, 
roofs, and foundations are compromised by penetra¬ 
tions that were not anticipated in the drawings (vents, 
tie rods, utility meters, etc.), or not adequately 
sequenced in construction to ensure the continuity of 
the thermal envelope. 

One of the most exciting new developments in 
enclosure design is green roofs and green walls. Green 
roofs and their layers of waterproofing, insulation, 
soils, and vegetation have the potential to improve 
roof resistance to heat gain by 50% (NRC reference - 
see [22]), improve roof longevity by a factor of 2 [23], 
reduce storm water runoff by 50% [24], reduce air 
plane noise transmission, as well as reduce urban heat 
islands and air pollution [25, 26]. Green walls are also 
offering measurable gains by reducing solar heat gain, 
providing evaporative cooling, and creating seasonal 
qualities for our buildings (Fig. 12). 


Green Walls + Double Skin Facade = Energy 
Savings 


In a 2005 controlled lab experiment in the Netherlands, 
Stec et al. identify an 18% reduction in cooling system 
capacity requirements and a 19% reduction in cooling 
energy consumption for a double skin facade with 
plants in the cavity rather than blinds, due to a 20% 
reduction in temperature on the interior wall of the 
facade (Stec et al. [27]). 

Facades that Integrate Climate-Responsive 
Shading and Glare Control 

As with heat loss and heat gain, managing solar 
overheating and glare begins with the careful sizing 
and location of windows to ensure shade and light 
without glare (Fig. 13). East and West orientations are 
the most problematic, since sun angles are too low to 
effectively block with overhangs or open blinds. Sky¬ 
lights in horizontal roofs can also be problematic since 
they receive excessive solar heat in summer and signif¬ 
icantly less winter energy (when solar energy is most 
beneficial). Window sizes and locations should be 
uniquely considered for each building face, as well as 
the detailing of internal and external controls. 
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Facades and Enclosures, Building for Sustainability. Figure 12 

Green walls at Musee Branly reduce heat gain in summer 



Facades and Enclosures, Building for Sustainability. Figure 13 

Critical facade elements for climate-responsive shading and glare control 


Integral shading devices , the selection of reflective or 
low transmission glazing, or the introduction of “frit” 
patterns to reduce solar transmission, are the most 
common strategies for shading and glare control. 


Unfortunately, low solar transmission glass will block 
useful gain in winter as well, and can also result in low 
daylight transmission. Many high reflectivity glass 
specifications inhibit daylighting, passive solar heating, 
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and clear views of the landscape. However, new devel¬ 
opments in glazing support high visible transmittance 
(daylight and view) with low solar transmittance (solar 
heat) for buildings in hot climates. Just as low-e glazing 
is becoming the norm for improved thermal resistance 
in glazing assemblies, this high-visible, low-solar glaz¬ 
ing must become the norm for all buildings where 
cooling loads dominate on the perimeter. 

A number of new glazing materials are emerging 
that support seasonally dynamic management of solar 
heat and light. Electrochromic, prismatic, and holo¬ 
graphic glazing assemblies are in development that 
would enable the seasonal or hourly separation of 
light and heat spectrums, and the depth and color of 
the beamed light, so that building spaces can be natu¬ 
rally conditioned with the least amount of solar 
overheating or glare. The combination of these heat 
and light redirection devices with exterior and interior 
shading devices enable the appropriate percentages of 
the solar light and heat to be independently tapped. 

Exterior shading devices , overhangs, light shelves, 
blinds, fins, and shades are the most effective means of 
reducing solar load while maintaining visual connection 
with the landscape. Clearly, the shading devices must be 
designed for each orientation, and for each site’s cli¬ 
mate and latitude. At least six types of exterior devices 
can be explored: an overhang/light shelf, recessed win¬ 
dows that act as an “egg-crate” shading device, dynamic 
shades or awnings, exterior blinds, exterior green walls, 
and a second dynamic facade (Fig. 14). 


Just as a baseball cap allows a clear view of the action 
without glare in the eye, overhangs or light shelves can 
allow for views without glare and solar overheating. 
While an overhang provides shade from high sun 
angles, it could act as a light shelf and reflect sunlight 
deeper into the space when positioned below the upper 
(transom) portion of the window wall. Fully retractable 
exterior blinds can support both seasonal shading and 
daylighting goals, and enable views to be fully enjoyed 
during those hours when the sun is away from the 
facade. Awnings offer seasonally and daily adjustable 
shading on all orientations, full views when the summer 
sun has passed, and a colorful gaiety to the building. 
Fandscape materials are re-emerging as effective shad¬ 
ing for roofs, walls, windows, and terraces, self¬ 
regulating their density of shade to match the seasons, 
and offering potential to manage storm water as well. 


Green Walls = Energy Savings 


In a 2000 controlled experiment at Tanz Greenhouse at 
the University of Toronto and a follow-up computer 
simulation study, Bass and Baskaran identify a 23% 
reduction in cooling load and a 20% reduction in the 
fan energy use, for an 8% reduction in total annual 
energy consumption, when a green wall was used to 
shade an exterior surface of the building, as compared 
to an unshaded surface (Tanz Greenhouse/Bass and 
Baskaran [28]). 




Facades and Enclosures, Building for Sustainability. Figure 14 

Exterior shading devices at the Chesapeake Bay Foundation Headquarters are designed to account for different building 
orientations 
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The line of last defense, interior shading devices are 
somewhat less effective than exterior shading at mini¬ 
mizing solar load (heat gain from the sun), since the 
energy is able to enter the building before it can be 
reflected. While roll-down mesh shades provide a level 
of solar heat and glare control, inverted horizontal 
Venetian blinds provide heat and glare control as well 
as daylight contribution for sustainable buildings. 
Research at the Lawrence Berkeley National Laborato¬ 
ries demonstrates that exterior operable shading 
devices, and even interior shading devices, are more 
cost effective and energy/resource effective than highly 
tinted (low visible transmission) glass in all regions of 
the USA , from Los Angeles to Chicago to Miami [29]. 
The selection of interior blinds should maximize light 
reflection, typically with a white or metallic color; 
maximize light distribution in the room, typically 
through inverted blinds and carefully shaped blinds 
that act as effective light shelves; and provide adjustable 
slat angles to maximize view potential while also 
maximizing work-plane illuminance [30] (Fig. 15). 

If they are combined with exterior shades in 
a multi-layered facade, interior shades and blinds can 
be dedicated to light diffusion and glare management, 
with fabric/color selection designed to minimize 
brightness contrast between the window and its sur¬ 
roundings. Coupled with operable exterior shading 
devices, interior blinds could also be selected to absorb 
solar heat for perimeter heat gain in winter months. As 
innovations in phase change materials emerge, it is 


possible to imagine internal shutters or shades that 
absorb solar heat during the day, and then are reversed 
at night to radiate solar heat to the occupied space. 

The most far reaching facade development in the 
past 10 years has been the development of the double 
envelope. When a second facade is introduced 
1-3 m from the window wall, the intermediate space 
can be used to moderate the environment - heat, light, 
wind, noise, pollution, and other environmental 
stressors. These second facades house layers of shading, 
light redirection, air redirection, thermal load balancing, 
mechanical conditioning, and even dynamic levels of 
resistance to heat loss and gain. The research on the life 
cycle benefits of these innovations reveals significant 
variations in both costs and performance [3 1 ]. If double 
envelopes provide the greatest number of hours, days, 
and months without mechanical conditioning, they 
will be invaluable to carbon-neutral designers. The 
following section explores a number of enclosure solu¬ 
tions that use dimensioned “circulatory systems,” such 
as double envelope designs, to reduce environmental 
loads in buildings (Fig. 16). 

Facades that Engineer Load Balancing and Mean 
Radiant Temperature Control 

To work toward healthier buildings with higher levels 
of local/individual comfort, it is imperative that the 
thermal imbalances, mean radiant imbalances , and 
conductive losses in buildings be addressed (Fig. 17). 



Facades and Enclosures, Building for Sustainability. Figure 15 

The dynamic light redirection louvers in the Robert L. Preger Intelligent Workplace can be configured to increase 
daylight levels in the interior spaces while reducing glare at the perimeter. Interior Venetian blinds have engineered 
w-sections that manage low and high angle sun differently, reducing direct glare while maximizing view and daylighting 
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Facades and Enclosures, Building for Sustainability. Figure 16 

Light shelves in the double envelope combined with external overhangs provide wonderful daylight and views for the 
new Cambridge public library 


Transom 


Viewing Field 


Kick Plate 


Spandrel 



Facades and Enclosures, Building for Sustainability. Figure 17 

Critical facade elements for load balancing and mean radiant temperature control 
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The engineering team is equally responsible with the 
architect for ensuring that the building enclosure 
detailing fully resolves thermal bridges and radiant 
imbalances. 

As the internal heat gains in buildings have 
increased with office automation, the disparity in tem¬ 
perature conditions between facades and interior 
spaces has grown. Surprisingly, some designers have 
removed or reduced perimeter conditioning on the 
assumption that the interior loads will compensate 
for perimeter losses. This convective exchange will not 
be effective, however, without specifying super- 
insulated facade assemblies and/or deliberate load 
balancing by mechanically moving “core” heated air 
or water through the window wall. 

Both air flow windows and water flow mullion 
systems enable excess heat from the building core - 
heat from occupancy, lights, and equipment - to be 
effectively dissipated through the facade. By taking 
return air through the “glass duct” of a triple glazed 
air-flow window, core cooling loads are dramatically 
reduced and perimeter heating is almost eliminated 
through core-to-perimeter load balancing. Water mul- 
lions (thermally broken from the outdoors) can also 
use “waste” heat from the core to minimize loads. In 


this system, waste heat from building cooling or power 
generating systems can be circulated through the facade 
to eliminate perimeter heating requirements and radi¬ 
ant imbalance, while allowing an increase in building 
periphery for views and light at every workstation. 
Indeed, the building facade could be seen as the natural 
dissipater of energy, a “circulatory system” resembling 
that of a healthy human - with appropriate surface to 
volume ratios. Given the constant increases in internal 
loads today, there is a real justification for increasing 
the periphery of buildings with the facade designed as 
an integral part of the mechanical system. 

In addition to core and perimeter load differences, 
there are also significant load imbalances at different 
facade orientations, typically driven by solar load dif¬ 
ferentials (Fig. 18). One facade mechanical system of 
note is the use of double envelopes to provide north- 
south-east-west load balancing in climates where solar 
loads are significant and beneficial. The Occidental 
building in Niagara Falls is one of the earliest examples 
of a double envelope construction. When solar energy 
is received on the east, a natural convective loop of 
solar-heated air wraps the entire building. This con¬ 
tinues throughout the day to eliminate simultaneous 
heating and cooling, to maximize passive solar 



Facades and Enclosures, Building for Sustainability. Figure 18 

In addition to radiant ceiling panels, radiant water mullions are integrated into the facade to balance temperatures year 
round in the Intelligent Workplace using water for energy-efficient load balancing 
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contributions to the heating load, and to ensure excel¬ 
lent mean radiant conditions. In summer, the double 
envelope is vented to the outside, and precautions are 
taken for fire protection. The Occidental building uses 
70% less heating and cooling energy than a conven¬ 
tional office building in upstate New York [32]. 

A third strategy for reducing or eliminating the need 
for mechanical cooling is the use of day-night load 
balancing strategies. Night ventilation of a buildings 
structural mass can successfully reduce or eliminate 
cooling loads in cooler climates, or climates with day- 
night temperature swings. Combined with the careful 
monitoring of dew point temperatures to avoid 
overcooling and condensation, nighttime cooling of 
a building’s mass can provide adequate capacitance to 
absorb an entire day of internal gains. Thermal mass 
can be achieved through “heavy” or high-capacitance 
building walls and floors, or through the strategic 
introduction of phase change materials (PCM) to 
absorb and radiate heat as designed. A corollary to 
nighttime ventilation is the use of off-peak nighttime 
electricity to generate chilled water and ice or to chill 
PCMs for daytime cooling. This “ice storage” cooling 
strategy can be effectively bundled with nighttime ven¬ 
tilation of a building’s thermal mass to absorb daytime 
overheating loads without peak electricity demands. 
Phase change materials also offer the potential for stor¬ 
ing daytime solar heat for nighttime use, a strategy 
integrated into a number of PassivHaus projects [33]. 

For opaque enclosure surfaces, the value of high 
reflectivity surfaces in all overheated climates has led 
to a new era of “cool roof” design [34]. The combina¬ 
tion of high reflectivity and night venting of enclosure 
mass in climates with day-night temperature swings 
has the greatest potential for natural comfort and low 
energy cooling [35]. 


tile roofing; and 3.2% annual cooling energy savings 
and a 17.2% peak cooling load reduction from white 
asphalt shingles, as compared to dark asphalt shingles, 
on a single-family house (Florida Sustainable Energy 
Center/Parker et al. [35]). 


The final load-balancing strategy for reducing 
heating and cooling loads in buildings is to take advan¬ 
tage of ground and aquifer temperatures through 
ground-coupling. Ten feet below ground, and in under¬ 
ground aquifers, the temperatures are surprisingly sta¬ 
ble at year-round (inter-seasonal) averages of 
approximately 12-18°C (55-65°F). Given excellent 
temperatures for radiant, water and air-based cooling, 
high-performance building designers have tapped into 
this natural cooling potential through earth sheltered 
construction, large diameter “cool tubes,” and aquifer 
or ground sourced chilled water loops that serve radi¬ 
ant, fan coil, or heat pump systems. Ground coupling 
significantly impacts foundation and flooring design, 
and might impact the facade “circulatory” systems as 
well (Fig. 19). 

The benefits of active (mechanically integrated) 
load balancing to resolve mean radiant imbalances - 
from core to perimeter, from N/S to E/W, from day to 
night, and inter-seasonally through ground coupling - 
are many and are as follows: improved individual com¬ 
fort, productivity, and health [36, 37]; reduced build¬ 
ing material degradation from thermal differentials and 
condensation; significant energy conservation; and the 
potential to replace mechanical heating or cooling first 
costs and operating costs with high-performance 
building enclosures [38] (Fig. 20). 



Cool Roof = Energy Savings + Peak Demand 
Reduction + Extended Roof Life 


In a 2002 multiple building controlled experiment in 
Florida, Parker et al. of the Florida Solar Energy Center 
identify 17.8-24.9% annual cooling energy savings and 
a 28.5-35.5% peak cooling load reduction from highly 
reflective white metal, white barrel tile, and white flat 
tile roofing; 9.5% annual cooling energy savings and 
a 12.9% peak cooling load reduction from terra cotta 


Radiant Cooling = Individual Productivity + Energy 
Savings 


In a 1999 controlled field experiment and simulation 
study, Takehito et al. identify a measured 22% increase 
in speed and a 1.5% improvement in accuracy on simple 
tasks among women subjects and a simulated 10% 
HVAC energy savings in the Tokyo climate from a radiant 
ceiling cooling system as compared to conventional 
ceiling-based air conditioning (Imanari et al. [36]). 
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Facades and Enclosures, Building for Sustainability. Figure 19 

"Cool tubes" beneath the greenhouses at Phipps Conservatory provide free cooling 


Night Ventilation Cooling = Individual 
Productivity 


In a 2003 meta-analysis study, Seppanen et al. identify 
a productivity increase equivalent to 0.39 h of work per 
day (4.9% for an 8-h workday) due to nighttime venti¬ 
lation cooling of thermal mass, a very energy-efficient 
method of reducing daytime indoor temperatures by 
using nighttime air to cool a building's structure and 
furnishings (Seppanen et al. [37]). 


Geothermal Heat Pump = Energy Savings + First 
Cost Savings 


In a 1998 field study of Maxey Elementary School in 
Lincoln, Nebraska, Shonder et al. identify an average 
47% reduction in annual HVAC energy use with a geo¬ 
thermal heat pump system as compared to the alterna¬ 
tives of VAV systems with air- or water-cooled chillers, as 
well as a first cost savings of $1.80/sf. Geothermal heat 
pump systems were found to have the lowest first cost, 
operational cost, and life cycle emissions of the three 
systems (Maxey Elementary School/Schonder et al. [38]). 


Facades that Integrate Passive and Active Solar 
Heating, Cooling, and Power 

The most direct use of solar energy in buildings is 
for passive solar heating (Fig. 21). The design of build¬ 
ing enclosures to support direct gain through high 
solar transmission glass can effectively offset over 
50% of the heat loss of buildings in many climates. 
Indeed, it is critical that glazing be carefully selected 
to ensure high solar transmission in cold climates 
to enable passive solar heating. In locations where 
direct gain would cause excessive daytime overheating 
or glare, the design of indirect gain passive solar 
facades - glazed mass walls or Trombe walls [39], 
water walls, or phase change walls - can strategically 
collect solar heat for nighttime heating. Recent devel¬ 
opments in phase change materials that can be 
embedded in common construction or finish mate¬ 
rials will help ensure that adequate solar heat storage 
exists to shift daytime solar gain to nighttime 
loads. Each of these solar heating strategies can be 
fully passive, utilizing no mechanical energy to col¬ 
lect, store, or distribute solar heat while providing 
potential health benefits to building occupants [40] 
(Fig. 22). 
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Sunlight = Health 


In a 2005 building case study of Inha University Hospital 
in Korea, Choi identified a 41% reduction (3.2 days) in 
average length of stay among gynecology patients in 
brightly daylit rooms (317 lx average), as compared to 
those in dull rooms, in the spring, and an average 26% 
reduction (1.9 days) in average length of stay among 
surgery ward patients in bright rooms, as compared to 
those in dull rooms, in the fall. Across all seasons, the 
average daylight illuminance in bright rooms was 317 
lx, compared to 190 lx in dull rooms (Inha University 
Hospital/ Choi [40]). 


The solar heat gained in building atria or double 
envelope facades can also be actively transferred to 
a range of storage media, from rock storage to phase 
change storage to underfloor air systems with thermal 
mass. Active solar hot water collectors integrated into 
building roofs or facades can also transfer solar heat to 
architectural or mechanical heat exchangers. The effec¬ 
tive integration of solar heat gain with the ventilation 
air systems in buildings can significantly reduce heating 
loads while supporting greater ventilation rates for 
human health. 

While solar thermal collectors can provide excellent 
hot water temperatures for heating, concentrating solar 
thermal collectors can generate the extreme high tem¬ 
peratures needed for cooling and power generation. 
Bundled with the latest in absorption chillers and 
steam power generators, a new generation of solar- 
evacuated tube and concentrating collectors are begin¬ 
ning to transform building enclosures to help meet the 
growing electrical demands in buildings for air condi¬ 
tioning (Fig. 23). 

Finally, integrating photovoltaic solar (PV) elements 
into the building curtain wall or roof can generate on¬ 
site power for feeding into the grid or for immediate 
use in the building. For maximum efficiency, PV energy 
could be used directly as DC electricity (without trans¬ 
former loses to AC) to meet the power demands of fans 
and air conditioners, plug loads, and dynamic facade 
elements - from operable windows to light shelves 
and shading devices. Recent thin film developments 
have significantly improved the performance of 
building integrated photovoltaics (BIPV), supporting 



Facades and Enclosures, Building for Sustainability. 
Figure 20 

Architecturally articulated thermal mass at Gaudi's Sagrada 
Familia minimizes daily temperature swings by night 
ventilation through architecturally expressive openings 


a range of design choices regarding shape, orientation, 
transparency, and color. For windows, new PV glazing 
assemblies can provide effective shading, daylight 
distribution, and a competitive power source. The 
next generation of building windows may absorb 
solar energy in almost transparent coatings and carry 
the electricity to window mullions for transport to 
the grid. For roofs and walls, the early PV assemblies 
that were rigidly sandwiched behind glass have been 
replaced by newer flexible PV materials that can replace 
common building materials, with 3-10-year life- 
cycle justifications. The vast areas of roof and facade 
that absorb or reflect long hours of sunshine can be 
turned into an asset even in hot climates, with building 
integrated photovoltaics playing a major role in gener¬ 
ating distributed power, eliminating the source and 
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Facades and Enclosures, Building for Sustainability. Figure 21 

Critical facade elements for passive and active solar heating, cooling, and power 



Facades and Enclosures, Building for Sustainability. Figure 22 

Interior and exterior views of the south facade at Fraunhofer Institute in Freiburg illustrate the integration of active and 
passive solar strategies 


distribution inefficiencies of conventional power pro¬ 
duction for greatly reduced carbon impacts. 

Wind power generation can also be integrated with 
building facades and roofs. A new generation of micro¬ 


turbines may create the parapets of future buildings, 
and large wind farms may occupy the upper floors of 
skyscrapers, as proposed in early solutions for the 
World Trade Center by Battle Engineering [41]. 
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Facades and Enclosures, Building for Sustainability. Figure 23 

Concentrating solar thermal collectors on the roof of the Intelligent Workplace produce steam in the summer to power 
absorption chillers to cool the space 


Photovoltaic System = Energy Savings 


In a building case study of Bren Hall, a LEED Platinum¬ 
rated building at the University of California, Santa 
Barbara (UCSB), Aigner identified that the building's 
47 kW rooftop photovoltaic system reduces purchased 
electricity consumption by 7-10% annually, at a first 
cost of $240,000, for an ROI of 5% (Bren Hall/ Aigner 
[42]). 

Facades Designed to Manage Water 

A fundamental role for high-performance building 
enclosures is the management of water. It is imperative 
that roofs and facades effectively manage rainfall 
through material selection and articulated detailing 
from top to bottom (Fig. 24). While gargoyles and 
expressive cornices that were once the solution 
to ensure heavy rains were shed away from the 
building, modern facades rely far too heavily on the 
caulked joint - often with poor results as the facades 
age (Fig. 25). 

A lesser understood role for building facades is 
to manage the vapor migration that might lead to 


condensation on embedded surfaces that are below 
dew point temperature. This is typically achieved 
through the design of continuous vapor barriers on 
the “warm side of the facade” to stop the humidity 
from migrating into the wall. Detailing in design 
and construction supervision is critical, especially in 
complex climates where humidity and cold tempera¬ 
tures may both be from the inside or outside. In 
addition, moisture that is trapped in construction 
materials must be released naturally as well, typically 
through a vapor porous skin or vented air spaces [43]. 
Rain screens are an innovative solution to managing 
both rain and vapor migration. Rain screens introduce 
an uncaulked and vented second skin for the facade - 
typically of ceramic tile or metal or wood panels - that 
sheds the rain and utilizes the vented air space to 
eliminate moisture buildup. 

As we embrace a greater set of sustainability 
challenges, building enclosures also need to collect 
rainwater today. As we face declining sources of 
fresh water, and increasing storm runoff challenges, 
the importance of designing building enclosures 
to catch and store water is of growing importance 
(Fig. 26). 
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Facades and Enclosures, Building for Sustainability. Figure 24 

Critical facade elements to manage rainwater and moisture migration 



Facades and Enclosures, Building for Sustainability. Figure 25 

On the left, water sculpture by Herbert Dreisetl playfully manages storm water, enticing visitors to return in the rain to 
watch. On the right, rainwater capture provides acoustic and visual richness in the Dockside Green development by Joe 
Van Bellingham 
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Green Roof = Energy Savings + Peak Load Reduction + 
Storm Water Retention 


In a 2002 building case study of the Chicago City Hall, 
the city of Chicago identified annual cooling energy 
savings of 0.02 kWh/square foot and annual heating 
energy savings of 0.02 therms/square foot, as well as 
a 70% reduction in storm water runoff, from installation 
of a 20,000 sf green roof (Chicago City Hall/ City of 
Chicago [44]). 


Green Roof = Energy Savings + Storm Water 
Retention 


In a 2003 building case study of the 901 Cherry Offices 
of Gap, Inc. in San Bruno, CA, Green Roofs for Healthy 
Cities identify a 100% reduction in storm water runoff 
(7.54 gal/sf of roof area) and a simple payback of 
11 years from energy cost savings alone (equivalent to 
7% annual energy savings in a baseline building) due to 
installation of an extensive green roof (901 Cherry 
(GAP)/ Green Roofs for Healthy Cities [45]). 


Facades that Maximize Enclosure Life 

High-performance building enclosures will need to use 
materials sustainably (Fig. 27). Wherever possible, 
existing enclosures should be preserved to protect 
their “embodied” energy, or built of 100% recycled 
content materials. If virgin materials are used, their 
“chain of custody” should be known in relation to 
energy, carbon, rare materials, and toxicity. Toxic mate¬ 
rials should be avoided in all assemblies, or designed for 
retrieval during disassembly. 

Enclosure assemblies should be designed for their 
longevity, for natural weathering and effective main¬ 
tainability. Most critically, enclosures should be 
designed and built for cherish-ability, the desire on the 
part of owners and the community to want to preserve 
the building rather than tear it down. This is typically 
ensured by the use of quality materials, assembled for 
longevity, and a high level of craftsmanship (Fig. 28). 

Enclosure design must also address the needs for 
human safety in the face of manmade and natural 
disasters. This includes structural integrity in the face 
of earthquakes, hurricanes, and cyclones as well mate¬ 
rial integrity in the face of fires, floods, and blasts. 




Facades and Enclosures, Building for Sustainability. Figure 26 

Green roofs at the California Academy of Sciences capture storm water while providing a learning environment for 
landscape and habitats 
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Facades and Enclosures, Building for Sustainability. Figure 27 

Critical facade elements to maximize enclosure life 


Design for disaster protection should be about human 
safety not building protection, ensuring adequate 
egress and adequate safe harbors. Design for human 
safety must also ensure adequate access to water, 
breathing air, and light, even in the face of power out¬ 
ages - a redundancy that can be best achieved by 
sustainable facade design that fully incorporates natu¬ 
ral conditioning. 

At the same time, building enclosures need to be 
designed for change - for expansion or contraction, for 
changes in use or function. Change can be supported 
through building generosity, floor-to-floor heights, 
size of openings, and level of building articulation. 
Change can be supported by modularity, with a kit of 
parts that anticipates the evolution of scale and func¬ 
tion over the life of the building. Change can also be 
accommodated by design for disassembly , by which the 
building and/or its enclosure can be relocated to 
a second or third life. All high-performance enclosure 
components, such as window and door assemblies, 
water management systems, and ornamental parapets, 
should be designed to be recovered as the “value- 
added” assemblies that they are well beyond the value 
of the raw materials (Fig. 29). 


Construction Waste Management = Salvage/Waste 
Savings 


In a 2004 building case study of the renovation of the 
Boston Scientific Company, Inc. office building, 
the Institutional Recycling Network identified a 32% 
reduction in disposal costs due to a comprehensive 
waste management process, compared to the cost of 
renovation with conventional landfill disposal of mate¬ 
rial waste. A site separation process allowed materials to 
be disposed of at lower costs than co-mingled materials 
and resulted in a 92% recycling rate by weight (Boston 
Scientific Company/ Institution Recycling Network [46]). 


Construction Waste Management = Salvage/Waste 
Savings 


In a 2000-2001 building case study of the demolition of 
the 133,000 square foot Hume Residence Hall on the 
campus of the University of Florida, Guy and Strong 
identify new revenues, reduced disposal costs, 
and cost avoidance from substitutions of reclaimed 
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Historical structures were designed and built to last a lifetime 
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Forty percent of the total waste stream can be attributed to the construction industry 
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materials, achieved through a process of salvage and 
source separation of "debris" building materials. 
A comprehensive C&D debris management process 
resulted in a net savings of 4% compared to the cost 
of conventional demolition with extensive landfill dis¬ 
posal (University of Florida Hume Residence Hall/Guy 
and Strong [47]). 


Future Directions: Facade Innovation Through 
Systems Integration 

In times of limited resources and limited building budgets, 
some of the most innovative high-performance facades 
are achieved through systems integration (Fig. 30). At a 
very minimum, high-performance sustainable building 
facades will have to be collaboratively designed to: 

• Integrate structure and enclosure to ensure heat 
loss/heat gain control, provide shading and glare 
control, and support daylighting. 

• Integrate fire and enclosure to support load 
balancing innovations. 


• Integrate HVAC and enclosure to support natural 
ventilation, load balancing, and passive and active 
solar energy utilization. 

• Integrate interior systems and enclosure systems for 
access to nature, daylighting, natural ventilation, 
shade and glare control, and passive solar heating. 

• Ensure design for enclosure life and design for 
change without waste. 

When expert disciplines work collaboratively early in 
the design process, a range of innovations are generated - 
structural systems that offer shading and light redirec¬ 
tion; enclosures that are integral with the mechanical 
duct or piping; fire egress that provides shade and access 
for maintenance; ornament that manages water; and 
expressions of function, culture, or ambition that serve 
a critical purpose for ensuring the highest level of indoor 
environmental quality and environmental sustainability. 
Sustainable building enclosures are the deliverables of 
integrated design teams and integrated building delivery 
processes that in turn are pivotal to achieving the highest 
level of health, productivity, energy, and environmental 
benefits (Fig. 31). 



Facades and Enclosures, Building for Sustainability. Figure 30 

Critical facade elements for innovation through systems integration 
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Whole Building = Energy Savings 


In a 2000 building case study of BRE's Environmental 
office building in Garston, UK, Fisher et al. identify a 66% 
annual energy savings compared to a typical type III UK 
office building due to the use of thermal mass, natural 
ventilation, passive and low energy cooling, daylight¬ 
ing, high-performance electric lighting, and on-site 
power generation (BRE Environmental Building/Fisher 
et al. [48]). 


Floor-Based Ventilation + Increased Outside Air = 
Health 


In a 2000 multiple building study of 39 schools in 
Sweden, Smedje and Norback identify a 69% reduction 
in the 2-year incidence of asthma among students in 
schools that received a new displacement ventilation 
system with increased fresh air supply rates and dedi¬ 
cated exhaust, as compared to students in schools that 
did not receive a new ventilation system (Elementary 
School/Smedje and Norback [49]). 


High-Performance Building = Energy Savings + Rent 
Premium 


In a 2004 building case study of the Phillips Eco- 
Enterprise Center in Minneapolis, Brinkema identifies 
$0.31 /sf energy savings and a 5-10% lease premium 
for the building owner (a financial gain of $0.62/sf) 
due to high-performance building design that incorpo¬ 
rates passive solar and geothermal heating and cooling, 
daylighting and high efficiency electric lighting, 
salvaged building materials, nontoxic paints and 
finishes and landscaping with native plants (Philips 
Eco-Enterprise Center/ Brinkema [50]). 


High-Performance Building = Energy Savings + 
Individual Productivity 


In a 1992 building case study of ING Bank in Amsterdam, 
Bill Browning of Rocky Mountain Institute identifies a 
92% reduction in primary energy consumption and a 
15% reduction in employee absenteeism compared 




Facades and Enclosures, Building for Sustainability. Figure 31 

The proposed "Building As Power Plant" at Carnegie Mellon University attempts to integrate all building systems, 
assemblies, and components to export unneeded renewable energy while providing optimum comfort for the occupants 
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to the bank's former headquarters, due to high- 
performance design strategies including daylight, a nar¬ 
row floor plan that allows landscaped views for every 
occupant, passive solar conditioning, co-generation, and 
the use of heat exchangers (ING Bank/ Browning [51]). 
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Glossary 

Activity The concentration of a given radionuclide in 
a soil or sediment per unit mass, also referred to as 
the mass activity density. 

Erosion The mobilization of soil and sediment from 
its original location or a temporary store within the 
landscape by a range of agents, including water and 
wind. 
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Fallout radionuclides Radionuclides that enter the 
terrestrial and aquatic environments as fallout. 

Fluvial system Landscape systems associated with 
streams and rivers and driven by rainfall and runoff. 

Gamma spectrometry Measurement of the gamma- 
emitted radioactivity in soils and sediment, using 
a gamma spectrometer. 

Gross erosion The total amount of soil mobilized by 
erosion within a given area. 

Half-life The time taken for a radionuclide to decay 
to the point where its activity is half the original 
activity. 

Inventory An alternative term for the areal activity 
density, which represents the total radionuclide 
content of a soil or sediment deposit per unit sur¬ 
face area. 

Net erosion The total amount of eroded sediment 
transported out of a given area. This represents 
the gross erosion within that area minus the depo¬ 
sition occurring within that area. 

Off-site impacts The impacts of soil erosion associ¬ 
ated with the transport of eroded soil away from the 
original point of erosion, for example, by streams 
and rivers. 

On-site impacts The impacts of soil erosion associ¬ 
ated with the point where erosion occurs and 
including reduction in soil depth, loss of produc¬ 
tivity, and destruction of agricultural land. 

Sediment budget A conceptual model of the sources, 
transfers, sinks, and outputs of sediment for 
a catchment or river basin. The model may quantify 
mobilization, transfer, storage, and output. 

Sediment delivery ratio This represents the ratio of 
the amount of sediment leaving a field or catch¬ 
ment to the total amount of sediment mobilized by 
erosion within that area. It provides a measure of 
the efficiency of sediment transfer out of the system 
considered. The sediment delivery ratio will 
decrease as deposition increases. 

Sediment Material transported or deposited within 
fluvial, aeolian, and other erosional systems. 

Sediment source fingerprinting Use of a range of 
biogeochemical properties to characterize or finger¬ 
print different potential sediment sources. The 
properties of transported or deposited sediment 
are compared with those fingerprints, in order to 
determine its source. 


Sediment source The point, place, or area from which 
sediment was mobilized. 

Sedimentation A term frequently used to refer to the 
study of sediment transport in fluvial systems, 
including sediment mobilization, transport, and 
deposition, but also used in a more narrow sense 
to refer to deposition of sediment in sinks including 
river channels and floodplains, lakes and reservoirs, 
and estuaries and coastal environments. 

Definition of the Subject 

Soil erosion is an important problem in many areas of 
the world, where it reduces soil productivity, destroys 
agricultural land, and can threaten food security and 
the sustainable management of soil resources. The on¬ 
site effects of soil erosion are coupled with off-site 
effects, which relate to the downstream or downwind 
impact of the eroded soil. The resulting transport of 
increased amounts of fine sediment, particularly in 
streams and rivers, and the deposition of this sediment 
in downstream sinks gives rise to a range of important 
environmental problems. These include the degrada¬ 
tion of water quality and aquatic habitats. Concern for 
the sustainable management of both soil resources and 
catchments and river basins has directed attention to 
the need to develop effective erosion and sediment 
control strategies. Designing and implementing such 
strategies requires an improved understanding and 
quantification of the mobilization, transfer, and depo¬ 
sition of sediment within the landscape, so that miti¬ 
gation measures can be targeted to optimize their 
effectiveness. Existing measurement techniques for 
assembling data on erosion and sedimentation within 
the landscape face many limitations, and the potential 
for using fallout radionuclides to assemble the infor¬ 
mation required has attracted increasing attention over 
the past 30 years. Fallout radionuclides offer unique 
opportunities to document sediment mobilization and 
deposition rates and to trace the movement of fine 
sediment through the landscape. This entry reviews 
the important needs for information on erosion and 
sedimentation and the history of the application of 
fallout radionuclides in this field, describes and 
explains the basis for using fallout radionuclides in 
erosion and sedimentation investigations, and provides 
several different examples of their successful use. 
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Introduction 

Soil erosion represents the removal of soil from the 
location where it developed by erosion processes. 
These processes are driven by several agents, the most 
important of which are water and wind. Water erosion 
is the most widespread cause of soil loss and occurs in 
a wide range of environments. Wind erosion can be 
equally destructive, but its occurrence is more limited. 
Soil erosion can be viewed as a natural process, but 
rates of soil loss are commonly very low in natural 
undisturbed landscapes, where the soil will generally 
be well protected by vegetation from the destructive 
effects of rainsplash, surface runoff, and deflation. 
Under these conditions, the rate of soil loss is likely to 
be balanced by the rate of soil formation and the soil 
will be in a quasi-equilibrium. When, however, the 
vegetation cover is removed or reduced by human 
activities, such as land clearance and cultivation, exces¬ 
sive grazing or forestry activity, rates of soil loss can 
increase by several orders of magnitude. Such acceler¬ 
ated soil erosion can represent a serious threat to the 
sustainability of the soil resource and to food security, 
through both reduced crop productivity and the loss of 
agricultural land as a result of soil removal, gully devel¬ 
opment, and other destructive effects of erosion. 
Recent estimates indicate that ca. 1,000 Mha of the 
earth’s surface is currently adversely affected by water 
erosion and ca. 500 Mha by wind erosion, with about 
80% of these totals being associated with the develop¬ 
ing world [ 1 ]. Much of this erosion occurs on agricul¬ 
tural land. It has been estimated that about 80% of the 
world’s agricultural land suffers from moderate-to- 
severe erosion and that about 10% suffers from slight- 
to-moderate erosion [2], Even in developed countries, 
such as the USA, where soil conservation measures are 
widely applied, soil erosion represents a major prob¬ 
lem, and it has been reported that in the USA, about 
23 million hectares of cropland experience excessive 
soil erosion and that erosion rates exceed the tolerable 
soil loss on about a further 20 million hectares [3]. 

The annual global loss of agricultural land due to 
accelerated erosion has been estimated to be ca. 3 
million hectares [4] and the progressive destruction of 
the global soil resource by erosion is emphasized by 
the calculation that the world’s croplands are losing 
about 23 billion tons of soil in excess of new soil 


formation each year [5]. The rate of soil formation 
will be influenced by a wide range of factors, but 
a value of 1 t ha -1 year -1 is frequently cited as 
a typical value. This value is often referred to as the 
soil loss tolerance, since, if erosion rates exceed this 
value, the soil depth will progressively reduce and 
crop productivity is likely to decline. Global models 
have been used to assess the extent to which this value 
of 1 t ha -1 year -1 is exceeded and, based on the global 
pattern of land use in the 1980s, the global mean soil 
erosion has been estimated to be 10.2 t ha -1 year -1 , 
a value which is an order of magnitude greater [6]. 
Average erosion rates were shown to be highest 
in Asia (12.2 t ha -1 year -1 ) and lowest in Australia 
(3.0 t ha -1 year -1 ). 

Current concern for the impact of climate change 
and global warming has directed attention to likely 
changes in soil erosion rates. In most areas of the 
world, predictions suggest that soil erosion rates will 
increase and, in the global modeling exercise referred to 
above [6], climate change predictions associated with 
a doubling of C0 2 concentrations were coupled with 
erosion models to estimate that at the global scale, soil 
loss would increase by 9% by the 2090s as a result of 
climate change. When this prediction was coupled with 
likely changes in land use by the 2090s, soil erosion 
increased by 14%. Soils play a key role in the global 
carbon cycle by storing carbon. Accelerated erosion 
perturbs this cycle by mobilizing soil carbon that may 
be subsequently oxidized causing increased C0 2 emis¬ 
sions, and by redistributing soil carbon and sequester¬ 
ing it in longer-term depositional sinks [7, 8] . Both past 
and likely future increases in soil erosion therefore have 
important implications for the global carbon cycle. 

Although concern for soil erosion has traditionally 
focused on the on-site effects linked to loss of agricul¬ 
tural land and reduced productivity, it is important to 
recognize that much of the sediment mobilized by 
accelerated water erosion will find its way into streams 
and river systems leading to increased sediment loads. 
Increased sediment loads can give rise to a range 
of environmental problems linked to degradation of 
water quality and aquatic habitats, damage caused to 
infrastructure during floods, and increased deposition 
of sediment in river channels, lakes, and reservoirs [9]. 
These impacts are frequently referred to as off-site 
impacts, to distinguish them from the on-site effects 
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highlighted above. In many situations, the off-site 
impacts of soil erosion will be more serious than the 
on-site effects. Taken together, they represent a very 
large economic cost and it has been estimated that 
worldwide, the annual cost of both the on-site and 
off-site impacts of soil erosion on agricultural land 
amounts to about $400 billion [10]. However, it is 
difficult to quantify the economic cost of loss of biodi¬ 
versity, changes in the carbon cycle, and impacts on 
ecosystem services, and the real cost may therefore be 
substantially greater. 

The deposition of sediment within the landscape 
and in water bodies, such as rivers, lakes, reservoirs, 
and marshes, must be seen as an inevitable complement 
to the mobilization of sediment by erosion and such 
processes are frequently referred to as sedimentation. 
Most attention is generally directed to sedimentation 
associated with water erosion, since sediment mobi¬ 
lized by such erosion can be deposited before reaching 
the stream network and, once entering the river system, 
there will be many further opportunities for sedimen¬ 
tation downstream. A substantial proportion of the 
sediment moving through the river system will reach 
the outlet of the river basin and may then be deposited 
in estuaries, in coastal areas and in the deeper ocean. In 
contrast, soil mobilized by wind erosion is usually 
deposited locally, although the finer fractions can be 
transported over substantial distances and it may be 
less easy to link erosion and sedimentation. Equally, 
such sediment is likely to be deposited over large areas 
and its impact is likely to be much less. Again, sedi¬ 
mentation must be seen as a natural process, but accel¬ 
erated erosion will frequently be coupled with 
increased sedimentation, which can lead to environ¬ 
mental problems. These problems relate not only to 
the physical presence of deposited sediment, which can, 
for example, greatly reduce the water holding or storage 
capacity of a water body, but also to the nutrients 
and contaminants, such as pesticides, associated with 
the sediment. Release of nutrients and contaminants 
from sediment deposited in water bodies can cause 
eutrophication and pollution of those water bodies. 
Similarly, deposition of contaminated sediment or sed¬ 
iment with a high-nutrient content on river floodplains 
can cause contamination of soil and crops and changes 
in the species composition of native vegetation com¬ 
munities, in response to the changing nutrient status of 


the soil. Estimates suggest that the world’s large reser¬ 
voirs are currently losing about 1% of their total stor¬ 
age each year [11]. Such losses can have important 
implications for water supply and the annual cost of 
replacing this lost storage has been estimated to be ca. 
$6 billion. When this cost is combined with the prob¬ 
lems of finding new dam sites, when existing reservoirs 
are filled with sediment and can no longer operate 
effectively, the serious consequences of reservoir sedi¬ 
mentation are clear. 

Against this background, there is an increasing need 
to assemble information on soil erosion and sedimen¬ 
tation rates to inform and support the sustainable 
management of soils, catchments, and associated 
water bodies. Such demands have increased with the 
development and use of numerical models as manage¬ 
ment tools, since such models commonly require data 
for calibration and validation. Although a range of 
techniques have been developed for documenting soil 
erosion and sedimentation rates [12, 13], these tech¬ 
niques have important limitations. For example, in the 
case of soil erosion, erosion plots are widely used to 
obtain information on rates of soil loss under particu¬ 
lar conditions, (e.g., different soil types, slope gradi¬ 
ents, crop types, cultivation practices, etc.). However, 
such plots are costly to install, they need to be operated 
for an extended period to obtain reliable information 
on mean annual rates of soil loss due to the inherent 
temporal variability of erosional events, and the results 
obtained from a small bounded plot may be 
unrepresentative of the natural landscape with its 
more complex topographic form. In the latter context, 
plot measurements relate to the export of material 
across the lower boundary of the plot and thus only 
provide spatially averaged estimates of net soil loss. 
They cannot provide information on deposition of 
mobilized soil within the plot and thus soil redistribu¬ 
tion rates more generally (i.e., gross erosion, deposi¬ 
tion, and net erosion). Furthermore, soil redistribution 
rates may vary markedly according to location within 
a field in relation to the local topography (e.g., ridges 
and hollows and concave and convex slope profiles). To 
meet the requirements of the current generation of 
distributed numerical models, spatially distributed 
information on erosion and deposition rates will fre¬ 
quently be required. Similar problems are faced by 
attempts to document deposition rates on river 
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floodplains and in water bodies, such as lakes, reser¬ 
voirs, and wetlands. Short-term variation of sediment 
deposition rates and spatial variability of deposition 
rates may introduce the need to quantify such 
variability. 

The limitations of traditional measurement tech¬ 
niques referred to above have directed attention to the 
potential for using environmental radionuclides to 
provide information on soil erosion and sedimentation 
rates. The fallout radionuclides cesium-137 ( 137 Cs), 
unsupported or excess lead-210 ( 210 Pb ex ), and beryl¬ 
lium-7 ( 7 Be) have proved particularly useful in this 
context, because they are rapidly and strongly fixed 
on reaching the soil or sediment surface as fallout and 
their post-fallout redistribution provides a means of 
documenting soil and sediment redistribution within 
the landscape. In essence, measurements of the radio¬ 
nuclide areal activity density or inventory (Bq m -2 ) are 
made at a number of points across a field by collecting 
soil cores. By comparing the results of these measure¬ 
ments with a measurement of the reference inventory, 
which represents the inventory found in an adjacent 
undisturbed area, areas or points experiencing erosion 
and deposition can be identified. In the former case, the 
inventory at the sampling point will be less than the 
reference inventory, and in the latter case, the inventory 
will exceed the reference inventory. The degree to which 
the inventory departs from the reference inventory is 
a direct reflection of the magnitude of the erosion or 
deposition rate, and a range of conversion models have 
been developed to estimate erosion and deposition 
rates from the measurements obtained. In a similar 
way, fallout inputs to water bodies, which are subse¬ 
quently incorporated into sediment deposits can pro¬ 
vide a means of establishing the chronology of 
sediment deposits. In the case of 137 Cs, the occurrence 
of peak fallout in 1963 provides a time marker for that 
year and the rate of decrease of 210 Pb ex activity with 
depth affords a means of estimating the time elapsed 
since the deposition of sediment from a specific depth 
and thus the sedimentation rate. 

Early work on the use of fallout radionuclides to 
document erosion and sedimentation processes 
focused on 137 Cs and dates from the aftermath of the 
bomb testing of the 1950s and early 1960s. Cesium-137 
can provide information relating to the redistribution 
of fallout received in the late 1950s and the 1960s, 


which currently provides a timeframe of ca. 50 years. 
The use of 137 Cs has been complemented more recently 
by the use of 210 Pb ex and 7 Be, which provide informa¬ 
tion relating to longer (i.e., ca. 100 years) and much 
shorter (i.e., the past few weeks) timeframes, respec¬ 
tively [14]. Fallout radionuclides are being increasingly 
used in soil erosion and sedimentation investigations 
in many different areas of the world, and they are able 
to provide essentially unique information on rates and 
spatial patterns of erosion and sedimentation that can¬ 
not be obtained using traditional approaches. Particular 
advantages in terms of documenting erosion and sedi¬ 
mentation and particularly soil erosion include the 
ability to: 

• Provide retrospective information on average rates 
of soil redistribution or sediment deposition 
(e.g., for the past 50 or 100 years) on the basis of 
contemporary sampling often involving only 
a single site visit. 

• Provide both time-integrated data relating to the 
past 50 or 100 years ( 137 Cs and 210 Pb ex ) and data 
relating to individual events or short periods of 
heavy rainfall occurring at present ( 7 Be). 

• Integrate the various processes contributing to the 
redistribution of soil and sediment particles in 
the landscape. 

• Obtain information from the “natural” landscape 
without the need for special installations or signif¬ 
icant disturbance of the study area. 

• Avoid the need for costly long-term monitoring. 

• Assemble spatially distributed information that is 
directly compatible with recent advances in the 
development of physically based distributed models 
and the application of GIS and geostatistics to 
modeling erosion and sedimentation. 

• Provide an approach that is applicable to several 
different components of the system, including ero¬ 
sion, sediment delivery, and sediment storage and 
deposition, as well as to tracing sediment sources. 
In the case of water erosion, it is also able to inte¬ 
grate processes associated with sheet erosion, rill 
erosion, and tillage erosion. 

• Provide an approach that is applicable throughout 
the world, since fallout radionuclide fluxes and 
inventories are of sufficient magnitude to permit 
their use in most areas of the globe. 
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This entry provides an overview of the basis for 
using fallout radionuclides in the study of erosion and 
sedimentation and of the information that can be 
obtained. It provides a brief history of the approach; 
a description of the key characteristics of the individual 
fallout radionuclides, including their source, fallout 
input, and post-fallout behavior; a discussion of their 
use to document soil erosion and soil redistribution, as 
well as sediment deposition; a brief review of field and 
laboratory techniques and key issues that need to be 
addressed; and some examples of applications, based 
on the work of the author over the past 30 years. 

A Brief History of the Use of Fallout Radionuclides 
in Erosion and Sedimentation Studies 

As indicated above, work on the use of fallout radio¬ 
nuclides in soil erosion and sedimentation investiga¬ 
tions largely dates from the aftermath of the nuclear 
weapons testing that occurred in the 1950s and early 
1960s, although early work on the use of 210 Pb ex to 
establish the chronology of sediment cores [15] devel¬ 
oped essentially independently of this. The nuclear 
weapons testing that occurred in the 1950s and 1960s 
caused the fallout of a range of radionuclides, including 
strontium-90 ( 90 Sr) and 137 Cs. Menzelwas arguably the 
first to recognize the link between radionuclide loss and 
soil loss from plots [16], although his work focused on 
90 Sr. Subsequent studies [17-19] highlighted the link 
between 137 Cs loss and soil loss, and McHenry and 
Ritchie demonstrated that the spatial pattern of 137 Cs 
inventories within a field could be used to identify areas 
of erosion and deposition [20]. Based on this early 
work, the potential for using 137 Cs to document soil 
erosion and sediment redistribution rates has been 
successfully exploited in many different areas of the 
world. In addition to the very important lead provided 
by Ritchie working in the USA, particular reference can 
be made to pioneering work undertaken in Canada 
[21, 22], Australia [23-25], and the UK [26, 27]. The 
Chernobyl accident in 1986 also provided a significant 
input to the expansion of work on the application of 
137 Cs measurements, by directing renewed attention to 
the behavior and fate of radiocesium in the environ¬ 
ment and providing new opportunities to exploit the 
approach [28]. A further important stimulus to work 
in the field was provided by the International Atomic 


Energy Agency (IAEA). This body sponsored two 
Coordinated Research Projects between 1995 and 
2001 aimed at promoting and standardizing proce¬ 
dures for using 137 Cs measurements in soil erosion 
and sedimentation investigations. These two Coordi¬ 
nated Research Projects culminated in the publication 
of a handbook for the assessment of soil erosion and 
sedimentation using environmental radionuclides that 
focused on the use of 137 Cs [29]. 

A clear and useful demonstration of the near expo¬ 
nential growth of studies involving the application of 
137 Cs in soil erosion and sedimentation investigations 
between the late 1960s and the mid 1990s is provided 
by the number of entries for specific years included in 
the bibliography of publications of 137 Cs studies related 
to erosion and sediment deposition, compiled by 
Ritchie and Ritchie [30]. Although some of the publi¬ 
cations included in this bibliography relate to broader 
aspects of the behavior of 137 Cs in the environment and 
to related radionuclides, the trend demonstrated in 
Fig. 1 provides a graphic indication of the rapid expan¬ 
sion of work in this field. 

Although they focused attention on 137 Cs, the two 
IAEA Coordinated Research Projects referred to above 
also aimed to promote the use of other fallout radio¬ 
nuclides, and more particularly 7 Be and 210 Pb ex , in soil 
erosion investigations. The development of studies 
using these two fallout radionuclides commenced in 
the early and mid-1990s and important early work 
includes that of Wallbrink and Murray [31], Blake 
et al. [32], Walling et al. [33], Matisoff et al. [34], and 
Wilson et al. [35] with 7 Be, and Wallbrink and Murray 
[36], Walling and coworkers [37-39] and Matisoff et al. 
[34] with 210 Pb ex . Important drivers in promoting this 
work with additional fallout radionuclides included the 
possibility of addressing different timescales, with 
210 Pb ex covering up to ca. 100 years and 7 Be providing 
information on individual events and short periods of 
heavy rainfall. Another driver was the opportunity to 
increase the potential for using fallout radionuclides in 
areas of the world where 137 Cs inventories are very low 
and where these inventories will decline further in the 
near future, due to radioactive decay and the lack of 
contemporary fallout inputs. 

Application of 137 Cs measurements to the study of 
sedimentation rates in depositional environments and 
to establishing the recent chronology for sediment 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 1 

The growth in the annual number of publications reporting work involving 137 Cs listed in the bibliography produced by 
Ritchie and Ritchie [30] 


cores from lakes and reservoirs can again be traced back 
to the immediate aftermath of the bomb tests of the late 
1950s and early 1960s [40]. The potential for using the 
peak fallout input in 1963 (or 1964 in the southern 
hemisphere) as a time marker was rapidly exploited by 
many investigators, and the approach was applied to 
a range of depositional sinks, in addition to lakes and 
reservoirs, including wetlands [41, 42], estuaries and 
coastal saltmarshes [43, 44], and river floodplains 
[45, 46]. The further fallout input provided by the 
Chernobyl accident introduced a second time marker 
in many areas of Europe and adjacent areas, which has 
improved the temporal resolution that can be achieved. 
Work by limnologists and sedimentologists on the 
application of 210 Pb ex in establishing the chronology 
of sediment cores over the past ca. 100 years again 
developed in the early 1970s [47, 48] but proceeded 
essentially independently of work with 137 Cs. This in 
part reflects the different principle involved, namely, 
the reduction in activity with depth in response to 
decay, which was not dependent on the timing of 
weapons testing. The first application of the approach 
was to the accumulation of ice sheets and was described 
by Goldberg in the early 1960s [15]. Since the 1970s, 


key developments have included the use of direct mea¬ 
surements of 210 Pb ex using gamma spectrometry as 
an alternative to its measurement using alpha spec¬ 
trometry, which is considerably more demanding in 
terms of the chemical extraction required, and the 
development of improved models for interpreting 
the 210 Pb ex depth distribution and establishing the 
chronology [49-51]. Because of its short half-life 
(53 days), 7 Be is commonly limited to the thin upper 
layer of sediment cores and is of little value for dating or 
establishing sedimentation rates. However, this radio¬ 
nuclide has been used since the 1980s to distinguish 
very recent sediment from older sediment and to quan¬ 
tify sediment mixing [52] and to estimate depths of 
overbank sedimentation on river floodplains associ¬ 
ated with individual flood events [53]. 

Fallout Sources and Inputs 

As indicated above, this contribution focuses on the 
application of three fallout radionuclides, namely, 
137 Cs, 210 Pb ex , and 7 Be, in soil erosion and sedimenta¬ 
tion investigations. These three radionuclides differ in 
terms of their half-lives, their sources, their fallout 
inputs, and the temporal variation of the resulting soil 
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inventories, and it is these differences that mean that 
they are able to be used to provide different, but com¬ 
plementary, information. It is important that their 
sources and the key features of their fallout input should 
be understood, since they have important implications 
for the interpretation of the measurements used to 
derive estimates of erosion and sedimentation rates. 
A brief overview of the source and fallout input of the 
three radionuclides is therefore provided below. 

Cesium-137 

Cesium-137 (t V2 30.1 years) differs from the other two 
radionuclides in that whereas they are of natural origin, 
this radionuclide is manmade. It is found in the global 
environment primarily as a result of the testing of 
thermonuclear weapons (bomb tests) that took place 
during the period extending from the early 1950s to the 
early 1960s. In addition, a further fallout input 
occurred in parts of Europe and adjacent regions as 
a result of the Chernobyl accident in 1986. The bomb 
tests injected 137 Cs into the stratosphere, where it 
mixed and circulated globally before being redeposited 
as fallout. The first high-yield bomb test took place in 
1952, but 137 Cs fallout was below detection in most 
areas of the world prior to 1954. The subsequent tem¬ 
poral pattern of fallout reflected the magnitude and 
number of bomb tests and the residence time of the 



a Year 


137 Cs in the stratosphere (ca. 1-10 years). Figure 2 
summarizes available data on the temporal distribution 
of the annual fallout resulting from the bomb tests in 
the 1950s and 1960s, by combining data for New York, 
USA, and Milford Haven, UK to provide a typical 
record for the northern hemisphere, and from Adelaide 
and Brisbane to provide a typical record for the south¬ 
ern hemisphere. 137 Cs was not routinely monitored 
during the main period of bomb fallout, but reliable 
estimates can be derived from the measurements of 90 Sr 
fallout made at this time, since the two radionuclides 
are found in a fixed ratio. Most bomb tests were located 
in the northern hemisphere and Fig. 2 shows a gradual 
increase in 137 Cs fallout in the northern hemisphere up 
to 1959, when the temporary moratorium on bomb 
tests imposed in 1958 caused a reduction in fallout, 
followed by a further increase in fallout up to 1963 
when the Nuclear Test Ban Treaty resulted in 
a cessation of bomb tests and fallout began to decline. 
Fallout levels fell below detection by the early 1980s. 
The further fallout input that occurred in 1986, as 
a result of the Chernobyl accident, was localized and 
more short-lived, because the radiocesium plume 
released from the accident was only dispersed into the 
troposphere. The data presented in Fig. 2 are estimates 
of annual fallout input. During a given year, the pattern 
of fallout is likely to have reflected that of precipitation, 
because most of the fallout occurred as wet fallout. 



Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 2 

Typical 137 Cs fallout records for the northern (New York, USA/Milford Haven, UK) and southern (Adelaide/Brisbane, 
Australia) hemispheres 
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The bomb tests injected 137 Cs into the stratosphere, 
and it was globally distributed. The global pattern of 
fallout reflects the global pattern of precipitation, since 
most fallout occurs as wet fallout in association with 
precipitation, the location of the bomb tests, and the 
pattern of stratospheric mixing. Because most tests 
occurred in the northern hemisphere, the global pat¬ 
tern of fallout evidences a clear latitudinal zonation. 
Figure 3 presents a generalized map of the global pat¬ 
tern of 137 Cs fallout associated with the bomb tests, as 
represented by the inventories found in 2000. Maxi¬ 
mum fallout occurred in the mid-latitudes of the 
northern hemisphere and minimum fallout occurred 
around the equator. Overall, the amount of fallout 
received in the southern hemisphere was only about 
30% of that received in the northern hemisphere. This 
feature of the global fallout pattern introduces some 
limitations on use of 137 Cs in the southern hemisphere, 
since the considerably lower inventories can introduce 
some measurement constraints. 

It has been estimated that the amount of 137 Cs 
released by the Chernobyl disaster was of the order of 
15% of that produced by the bomb tests. Because of its 
more limited tropospheric transport and dispersal, 


significant Chernobyl fallout is found over a more lim¬ 
ited area of the globe located around the Chernobyl 
source (see Fig. 4). Here the pattern reflects the trajec¬ 
tory of the plume of contaminant released from Cher¬ 
nobyl and the distribution of precipitation during the 
immediate post-accident period. Considerable local 
variability has been reported in the case of Chernobyl 
fallout and this partly reflects the fact that the fallout 
was associated with a small number of rainfall events 
occurring within a short period. This contrasts with the 
situation for bomb fallout, which was distributed over 
a ca. 20-year period during which local variability is 
likely to have been smoothed. Reports of the magni¬ 
tude of the depositional fluxes that occurred in the 
immediate vicinity of Chernobyl suggest that these 
were as high as 1,480 kBq m -2 and therefore more 
than two orders of magnitude greater than the total 
fallout received in areas of high bomb fallout. In many 
areas affected by Chernobyl fallout, that fallout input 
will now dominate the total inventory and this must be 
taken into account when using 137 Cs measurements to 
document soil erosion and sedimentation rates. How¬ 
ever, there are also many areas in Europe where the two 
fallout inputs are of a similar magnitude and this may 




Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 3 

The global pattern of 137 Cs inventories (Bq m -2 ) in 2000 (based on Walling and He [54]) 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 4 

The pattern of 137 Cs fallout over Europe resulting from the Chernobyl disaster (Based on Smith and Beresford [55]) 


introduce problems when interpreting contemporary 
measurements of 137 Cs inventories. In the period 
immediately after the Chernobyl accident, it was pos¬ 
sible to distinguish the two components of the total 
137 Cs inventory by also measuring the cesium-134 
( 134 Cs) activity. 134 Cs (ty 2 2.06 years) occurred in 
a fixed ratio to 137 Cs in Chernobyl fallout and this 
provided a means of quantifying the proportion of 
the total 137 Cs inventory attributable to Chernobyl 
fallout. Because of its short half-life, this approach 
could only be used for a few years immediately after 
the Chernobyl accident. However, scope exists to use 
the plutonium ( 239,240 Pu) content of the current inven¬ 
tory to establish the relative contribution of Chernobyl 
and bomb fallout, since 239 > 240 p u was absent from 
Chernobyl fallout. 

As emphasized by Fig. 2, a key feature of 137 Cs 
fallout is that in most areas of the globe, virtually 


all of this fallout input occurred during the period 
1955-1970, with a peak in 1963. The basis for using 
this radionuclide to obtain estimates of erosion and 
sedimentation rates reflects this pattern of input. By 
studying the subsequent redistribution of the 137 Cs 
fallout in the landscape, it is possible to obtain infor¬ 
mation on soil and sediment redistribution over the 
past ca. 45 years. In addition, the occurrence of peak 
fallout in 1963 provides a valuable time marker in many 
depositional environments. Additional fallout inputs 
associated with the Chernobyl accident may mean 
that the main fallout input occurred in 1986 or if the 
inputs were of a similar magnitude that there were two 
periods of fallout input, one extending from 1955 to 
1970 and the other occurring in 1986. The occurrence 
of the second fallout input can introduce a second time 
marker in sediment deposits which can prove very 
useful in establishing the chronology of a core. 
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Excess Lead-210 

Lead-210 ( 210 Pb, t Vl 22.3 years) is a natural geogenic 
radionuclide that is produced in the Uranium-238 
( 238 U) decay chain. Radium-226 ( 226 Ra, t V2 1,600 
years) is a product of that chain and is found in most 
soils and rocks. This decays to gaseous Radon-222 
( 222 Ra, ty 2 3.8 days) which in turn undergoes a series 
of short-lived decays to 210 Pb. During its short half-life, 
some of the 222 Rn escapes from the soil into the over- 
lying atmosphere, where it decays to 210 Pb, which 
returns to the earth’s surface as fallout. The residence 
time of 210 Pb in the atmosphere is estimated to be only 
a few days. The 222 Rn that remains in the soil also 
decays to 210 Pb and, since this is an in situ product 
of the 226 Ra, it is commonly referred to as supported 
210 Pb. It is possible to calculate the amount of 
supported 210 Pb that should theoretically exist in the 
soil, given the level of 226 Ra, by assuming equilibrium 
between the two radionuclides. However, the actual 
amount is likely to be less, since, under most condi¬ 
tions, some of the 222 Rn would have escaped to the 
overlying atmosphere. Without such escape or exhala¬ 
tion, supply of 222 Rn to the atmosphere and subsequent 
210 Pb fallout would not occur. To distinguish the 210 Pb 
reaching the soil surface as fallout from that produced 
in situ within the soil, the former is commonly referred 
to as unsupported or excess 210 Pb ( 210 Pb ex ). The total 
inventory (Bq m -2 ) of 210 Pb contained in a given depth 
of soil can therefore be partitioned into a supported 
and an unsupported or excess component. The former 
can be estimated from the amount of 210 Pb that would 
be expected if it was in equilibrium with the 226 Ra, 
reduced by a factor reflecting the proportion of the 
222 Rn that escaped to the overlying atmosphere. It is 
not easy to establish this factor empirically, but it can be 
estimated by considering the difference between the 
theoretical estimate of supported 210 Pb activity 
(assuming equilibrium between 226 Ra and 210 Pb) in 
soil or sediment below the depth influenced by surface 
inputs of 210 Pb ex and the measured activity at that 
depth. Since the fallout radionuclide (i.e., 210 Pb ex ) 
is the component of interest for use in documenting 
erosion and sedimentation rates, it is important to direct 
attention to this, rather than the total 210 Pb activity. 
Other sources of 210 Pb fallout include exhalation of 
222 Rn from the surface of the oceans, release of 226 Ra, 


222 Rn, and 210 Pb from volcanic eruptions, and indus¬ 
trial emissions. Because the amount of 210 Pb produced 
over the oceans as a result of release and decay of 222 Rn 
is about two orders of magnitude less than that over 
land, this source is of limited importance for terrestrial 
fallout. The contribution of volcanic eruptions and 
industrial emissions to global fallout of 210 Pb is again 
minimal, but these sources can be locally significant. 

In contrast to 137 Cs, the annual fallout of 210 Pb ex at 
a given location can be viewed as being essentially 
constant in the medium term. If it is assumed that the 
total fallout of 210 Pb ex is retained in the upper horizons 
of the soil of a stable reference site, that experiences 
neither erosion nor deposition, the mean annual fallout 
input can be estimated by assuming a steady-state 
equilibrium between fallout input and loss through 
radioactive decay. There are few longer-term records 
of annual 210 Pb ex fallout involving direct measurements 
that can be used to assess the degree of inter-annual 
variability, but a record from Munich-Neuherberg in 
southern Germany for the period 1981-1999 [56] 
demonstrated a twofold variation in annual fallout, 
with values ranging from 120 to 250 Bq m -2 year -1 , 
with a mean of 180 Bq m -2 year -1 . A similar factor of 
two for the inter-annual variability of 210 Pb fallout has 
been reported by other studies. This inter-annual 
variability can be partly accounted for by variations in 
annual precipitation, but the relationship between the 
annual 210 Pb fallout flux and annual rainfall is not 
always well defined. The distribution of fallout within 
the year will reflect both the precipitation regime 
and seasonal variation of the concentration of 210 Pb ex 
in the atmosphere, as well as the degree of mixing of 
the atmosphere and the volume of air scavenged by 
the rainfall. High concentrations are often found in 
the atmosphere during late spring and summer when 
drier soils and the absence of snow cover increase the 
exhalation of 222 Rn to the atmosphere from the soil 
[57]. Valles et al. [58] report a twofold variation in 
monthly 210 Pb concentrations in air sampled at 
Barcelona, Spain and at this location, concentrations 
were substantially higher in summer and autumn. 
However, equivalent information on air concentrations 
reported for Munich-Neuherberg [56] showed 
a contrasting pattern, with maximum values between 
September and February. At this site, maximum 
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monthly fallout fluxes occurred in the summer months 
between June and August, in response to the higher 
precipitation during these months. 

Less is currently known about the global variation 
of 210 Pb ex fallout than for 137 Cs. Values of annual fall¬ 
out reported in the literature range over more than an 
order of magnitude from <50 Bq m -2 year -1 to in 
excess of 500 Bq m -2 year -1 (see Table 1). In terms of 
the global pattern, available information suggests that, 
although annual fallout fluxes will reflect annual rain¬ 
fall, they are generally lower in the western areas of the 
continents, where the dominant air mass movements 
are off the oceans where very little 222 Rn will be pro¬ 
duced. In contrast, higher annual fluxes are commonly 
found in the eastern areas of continents where the air 
mass tracks pass over large areas of land where 222 Rn is 
produced and concentrations of dust increase. A clear 
latitudinal zonation in annual fallout of 210 Pb has also 
been reported [59], with much lower values being 
documented in the southern hemisphere, due to the 
greater expanse of ocean. The values listed in Table 1 are 
consistent with these general controls. However, it is 
important to note that there may be appreciable local 
variability in annual fallout deposition. Values of 
annual 210 Pb deposition of 59 and 85 Bq m -2 year -1 
in 1990 have been reported for two sites in the southern 
Netherlands that are only 5 km apart [60]. 

Beryllium-7 

Beryllium-7 ( 7 Be t V2 53.3 days) is, like 210 Pb, a naturally 
occurring fallout radionuclide. In this case, its origin is 
cosmogenic and it is produced in the troposphere and 
stratosphere by the cosmic ray spallation of nitrogen 
and oxygen. The production of 7 Be is controlled by the 
cosmic ray flux which varies with altitude, latitude, and 
solar activity [66] . Production is reported to be greatest 
at altitudes between 12 and 20 km, depending on 
latitude, and decreases almost exponentially toward 
the earth’s surface, where it is about three orders of 
magnitude lower [67]. Globally, production increases 
from the equator toward the poles and this increase is 
characterized by a factor of approximately 1-3 for 
the troposphere and 4-5 for the stratosphere [67]. 
Because 7 Be production is influenced by solar activity, 
it has been linked to sunspot numbers and the 11-year 
solar cycle. The increased incidence of sunspots reduces 


Fallout Radionuclides and the Study of Erosion and 
Sedimentation. Table 1 Mean annual deposition of 
210 Pb ex at different locations 


Location 

Mean 210 Pb fallout (Bq m" 2 
year 1 ) 

Tatsunokuchi, Japan 3 

840 

Hokkaido, Japan 

367 

Bombay, India 

250 

Munich, Germany 6 

180 

Galveston, USA C 

172 

New Haven, USA 

153 

Geneva, Switzerland 6 

150 

Delhi, India 

133 

Moscow, Russia 

115 

Bermuda 6 

115 

Calcutta, India 

102 

Darwin, Australia 

95 

Milford Haven, UK 

85 

Suva, Fiji 

80 

Groningen, the 
Netherlands 1 

71 

Alice Springs, Australia 

57 

Sydney, Australia 

53 

Auckland, New Zealand 

50 

East Antarctica 9 

31 


Adapted from Beks et al. [60], with additions. 

a From Turekian et al. [61] 

b From Winkler and Rosner [56] 

c From Yamamoto et al. [62] 

d From Baskaran et al. [63] 

e From Beks et al. [60] 

f From Preiss et al. [59] 

9 From Caillet et al. [64] 

the cosmic ray flux to the earth and thus 7 Be produc¬ 
tion. Because of the longer residence times of aerosols 
in the stratosphere, 7 Be activity remains relatively con¬ 
stant in the stratosphere. However, with lower produc¬ 
tion rates and shorter aerosol residence times due to 
rapid washout, 7 Be concentrations in the troposphere 
are one or two orders of magnitude less than that in the 
stratosphere. Available evidence suggests that >90% of 
7 Be deposition occurs as wet deposition [68]. 
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There are relatively few records of 7 Be fallout to 
provide a detailed basis for defining its characteristics, 
but available evidence indicates that in addition 
to inter-annual variability connected to sunspot activ¬ 
ity, 7 Be concentrations in precipitation and fallout 
fluxes vary seasonally. These variations will reflect the 
incidence of mixing between the stratosphere and tro¬ 
posphere, the altitude of precipitation generation, and 
precipitation amounts. Maximum concentrations and 
fluxes are often recorded in spring and early summer in 
mid-latitudes, in response to increased stratosphere- 
troposphere exchange and increased convection due 
to warming of the land surface. Considerable variability 
in 7 Be concentrations may be associated with individ¬ 
ual rainfall events, with some researchers reporting that 
activities can vary by a factor of greater than 20 between 
events. However, fairly well-defined relationships 
between 7 Be depositional flux and event rainfall are 


commonly found [64]. Because of its short half-life, it 
is more important to understand the magnitude and 
timing of fallout inputs, since these will have a much 
greater effect on the soil inventory than in the case of 
137 Cs and 210 Pb ex , where the inventory of these radio¬ 
nuclides will remain essentially constant in the short 
term and reflect the long-term pattern of fallout. 

As with 210 Pb, there are currently insufficient mea¬ 
surements to obtain a clear picture of the global pattern 
of annual 7 Be deposition. However, available measure¬ 
ments indicate that annual fluxes range between ca. 500 
and 6,500 Bq m -2 year -1 (see Table 2). The influence of 
annual rainfall amount is readily demonstrated by the 
high annual deposition of 6,350 Bq m -2 year -1 
reported for Hokitika in South Island, New Zealand 
[68], where the annual rainfall was ca. 2,800 mm. 
Since most available measurements are from the mid¬ 
latitudes, it is difficult to identify the latitudinal 



Fallout Radionuclides and the Study of Erosion and Sedimentation. Table 2 Annual deposition of 7 Be recorded at 
different locations 


Place 

Mean annual 
precipitation (mm) 

Annual 7 Be fallout 
(Bq m -2 year -1 ) 

Source 

Hokitika, New Zealand 

2,800 

6,350 

Harvey and Matthews [69] 

Tatsunokuchi, Japan 

- 

5,300 

Yamamoto et al. [62] 

New Haven, CT, USA 

1,390 

3,780 

Turekian et al. [70] 

Bermuda 

1,700 

2,850 

Turekian et al. [70] 

Galveston, USA 

1,167 

2,450 

Baskaran et al. [63] 

Geneva, Switzerland 

966 

2,087 

Caillet et al. [64] 

Norfolk, VA, USA 

1,313 

2,075 

Todd et al. [71] 

Milford Haven, UK 

1,328 

1,618 

Peirson [72] 

Bombay, India 

2,277 

1,267 

Lai et al. [73] 

Heidelberg, Germany 

810 

1,249 

Schumann and Stoeppler [74] 

Canberra, South Eastern Australia 

660 

1,030 

Wallbrink and Murray [68] 

Chilton, UK 

822 

898 

Peirson [72] 

Arkansas, USA 

1,070 

867 

Lee et al. [75] 

Westwood, NJ, USA 

787 

717 

Walton and Fried [76] 

Damascus, Syria 

153 

528 

Othman et al. [77] 

East Antarctica 

- 

700 

Nijampurkar and Rao [65] 

Malaga, Spain 

308 

412 

Duenas et al. [78] 
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influence that could be expected to result from the 
latitudinal variation in 7 Be production noted above. 
Because of its cosmogenic origin and the importance 
of production in the stratosphere, 7 Be deposition fluxes 
can be expected to be similar in both the northern and 
southern hemispheres. This contrasts with 137 Cs and 
210 Pb, where the location of the bomb tests and the 
much larger extent of the oceans in the southern hemi¬ 
sphere, respectively, exert an important influence in 
reducing fallout inputs in the southern hemisphere. 

Fallout Radionuclides in the Soil and Soil 
Inventories 

137 Cs, 210 Pb ex , and 7 Be fallout reaches the soil surface as 
positively charged cations, and in most environments, 
the fallout will be rapidly and strongly adsorbed by 
mineral particles and organic material and become 
essentially non-exchangeable [79-83]. Preferential 
adsorption by clay minerals and fine particles has 
been reported in many studies [84]. The strong affinity 
of these radionuclides for the soil is reflected by their 
high partition coefficients (K d ) which are commonly of 
the order of 10 4 -10 5 L kg -1 . Existing evidence suggests 
that whereas 137 Cs is primarily associated with the 
mineral fraction of soils, the organic fraction is also 
important in the uptake of 7 Be and 210 Pb ex . The speed 
of uptake and the kinetics of the adsorption are less well 
defined, but time spans for uptake or adsorption rang¬ 
ing from a few minutes to less than an hour are often 
cited. It is, however, important to recognize that in 
some environments, the fallout radionuclides reaching 
the soil surface may be less strongly fixed. This, for 
example, appears to be case for 137 Cs and peat soils, 
because there are a number of reports of Chernobyl 
fallout being poorly fixed by peat soils and remaining 
mobile within the soil and runoff. 

Where the soil supports a vegetation canopy, this 
will complicate the transfer of the radionuclides to 
the surface soil. In the case of 137 Cs and 210 Pb ex where 
emphasis is usually placed on the longer-term accumu¬ 
lation and fate of the fallout input in the landscape, 
such interception is of limited importance, since the 
radionuclides can be subsequently transferred to 
the soil by leaching and wash-off and by decay when 
the vegetation dies back or sheds its leaves. For, 7 Be, 
however, this interception can be more important, 


since the short half-life of the radionuclide can mean 
that a significant proportion of the fallout input maybe 
intercepted and subsequently lost from the vegetation 
canopy through radioactive decay and therefore never 
reach the soil surface. Work in forested areas in Maine, 
USA, has indicated that as much as 50% of the fallout 
input could be held in the forest canopy [85] and 
a study of 7 Be interception by grass undertaken at 
Bologna, Italy similarly indicated that more than 50% 
of the total inventory was held in the grass layer above 
the soil [86] . Kaste et al. [66] also suggest that soils with 
a grass cover are frequently characterized by a higher 
total areal activity density than bare soils, pointing to 
the role of the grass in trapping atmospheric aerosols. 
This interception has important implications for the 
use of 7 Be as a tracer of soil redistribution, since in areas 
with a vegetation canopy, it is likely to result in consid¬ 
erable local-scale variability in the soil inventory and it 
may prove impossible to establish a reference inventory 
for the soil itself which can be used to identify areas of 
erosion (i.e., reduced 7 Be inventories) and deposition 
(i.e., areas with increased 7 Be inventories). 

Because of the rapid and strong adsorption of the 
fallout input by the surface horizons of the soil, the 
total inventory or areal activity density (Bq m -2 ) mea¬ 
sured at a reference site, defined as a stable undisturbed 
site which has experienced neither erosion nor deposi¬ 
tion, will directly reflect the prior fallout input to the 
site, assuming that the transfer to the soil has not been 
influenced by a vegetation canopy as discussed above. 
Figure 5 provides a schematic representation of the 
variation of that inventory over the timescale relevant 
to the use of a particular fallout radionuclide as a tracer. 
For 137 Cs and 210 Pb, with half-lives extending to 
decades, this timescale covers ca. 50-100 years, whereas 
for 7 Be, with a much shorter half-life, it extends over ca. 
6 months. For 137 Cs (Fig. 5a), the key feature of the 
variation of the inventory over the period of interest is 
the absence of 137 Cs in the soil prior to the early 1950s, 
the subsequent rapid increase in the inventory until the 
mid-1960s, due to the bomb tests during the period 
extending from the mid-1950s until 1963, and the 
subsequent decline in the inventory since the mid- 
1960s, when loss from the inventory due to radioactive 
decay exceeded the residual fallout input from the 
bomb tests. In the absence of new 137 Cs fallout, 
the inventory will continue to progressively decline to 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 5 

A schematic representation of the interaction between the fallout input and the (a) 137 Cs, (b) 210 Pb ex , and (c) 7 Be inventory 
in the soil of a reference site over the timescale relevant to the use of the individual fallout radionuclides as tracers of 
soil and sediment redistribution 


the point where it will fall below the level of detection 
and the radionuclide can no longer be used as a tracer. 
This point is likely to be reached earlier in equatorial 
regions and in the southern hemisphere, where inven¬ 
tories are lower. In locations where an additional input 
of 137 Cs occurred in 1986, as a result of the Chernobyl 
accident, this input would have caused an increase in 
the inventory and a temporary halt to the decline. 
However, since Chernobyl fallout was short-lived, 
the declining trend was soon restored. 

For 210 Pb ex , the pattern of variation of the inventory 
is very different from that of 137 Cs and the inventory 
can be seen as essentially constant through time 
(Fig. 5b), with new fallout inputs balanced by losses 
caused by radioactive decay. For a reference site with 


a mean annual fallout input of 250 Bq m -2 year -1 , the 
steady-state inventory will be ~8,169 Bq m -2 . In real¬ 
ity, there will be small variations in the total inventory 
due to inter-annual variations in the fallout input, but 
since the annual fallout input represents only 
a relatively small proportion of the total inventory 
(i.e., ca. 3%), the effect of these fluctuations will be 
small. In the case of 7 Be, however, the magnitude and 
timing of the fallout inputs will exert a key influence on 
the variation of the inventory through time, due to the 
short half-life of this radionuclide. In the absence of 
further fallout, the inventory would decline to near¬ 
zero within about 6 months and the temporal pattern 
of fallout inputs will dominate the variation of the 
inventory through time. A timescale of ca. 6 months 
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has therefore been used in Fig. 5c to represent the 
key features of the variation of the 7 Be inventory 
through time. 

When considering in more detail the storage or the 
longer-term fate of fallout radionuclides within the 
soil, there is a need to distinguish undisturbed and 
cultivated soils. In the former, redistribution of the 
fallout input within the soil profile will be governed 


primarily by physicochemical diffusion and migration 
processes, including biological activity, whereas in the 
latter case, tillage will commonly mix the upper hori¬ 
zons of the soil and cause the radionuclide activity to be 
relatively uniformly distributed through the plow layer. 
Figure 6 presents typical 137 Cs, 210 Pb ex > and 7 Be depth 
distributions for undisturbed (i.e., permanent pasture) 
and adjacent cultivated area documented by the author 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 6 

Characteristic depth distributions of 137 Cs, 210 Pb ex , and 7 Be in undisturbed pasture soils (left) and cultivated soils 
(right) in Devon, UK. The precision of the radionuclide measurements at the 95% level of confidence is indicated by the 
error bars 
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and his coworkers for a site near Crediton in Devon, 
UK. Depth distributions of similar shape are likely to 
be found in different areas of the world, although the 
total inventory will vary. In Fig. 6, the depth scale has 
been expressed as a mass depth (kg m -2 ), since this 
removes the need to take account of variations in bulk 
density within the upper levels of the soil in influencing 
the precise form of the depth distribution. For 137 Cs, 
the depth distribution for an undisturbed soil shown in 
Fig. 6a is characterized by the typical exponential 
decline in activity with depth that has been widely 
reported for undisturbed soils. In this case, ca. 80% of 
the total inventory is found within the upper 10 cm of 
the soil profile. This depth distribution can be readily 
accounted for by the surface input of 137 Cs as 
fallout and its subsequent downward movement by 
diffusion-like processes. These processes include 
biological activity (e.g., earthworms), which cause 
mixing. In many undisturbed soil profiles, such as 
that illustrated here, the peak activity is found just 
below the surface, rather than at the surface, and this 
can be seen as reflecting the absence of significant 
fallout since the 1980s (in the absence of Chernobyl 
fallout), biological activity moving soil from depth to 
the surface, and the accumulation of dead organic 
material at the surface. The depth distribution of 
210 Pb ex for an undisturbed soil, illustrated in Fig. 6b, 
is, as might be expected, very similar to that of 137 Cs. 
However, in this case, the fallout input is continuous 
and the maximum activity is commonly found at 
the surface, where it is maintained by the fresh fallout. 
The depth distribution of 7 Be associated with an 
undisturbed soil shown in Fig. 6c differs substantially 
from that of the other two fallout radionuclides, in 
that all the inventory is contained within the upper 
ca. 10 mm of the soil and most is within the upper 
5 mm. This situation reflects the continuing input of 
7 Be fallout to the surface and the short half-life of 
the radionuclide, which means that any 7 Be slowly 
moving deeper into the soil disappears quite rapidly 
due to decay. 

In the case of 137 Cs and 210 Pb ex , the depth distribu¬ 
tions illustrated for an undisturbed soil in Fig. 6 can be 
viewed as essentially constant over a timescale of several 
years. The 210 Pb ex profile is likely to remain constant 
over a longer timescale, as the fallout input and down¬ 
ward movement are balanced by decay. In the longer 


term, the 137 Cs depth distribution would have changed, 
through time, reflecting the beginning of the fallout 
input in the 1950s, when most of the radiocesium 
would have been found near the surface, the subse¬ 
quent progressive downward movement of the radio¬ 
nuclide, the cessation of significant fallout in the 1980s, 
and the gradual reduction in the total inventory after 
the mid-1960s, when loss by decay started to exceed the 
input of new fallout. Existing experimental evidence, 
based on the artificial application of radiocesium to 
bare soil surfaces, indicates that fresh fallout is initially 
distributed within the top few millimeters of the soil. 
This situation is essentially replicated by the 7 Be depth 
distribution, since decay removes the evidence of 
medium-term downward movement. Because of its 
short half-life, the 7 Be depth distribution will be more 
dynamic, reflecting short-term changes in the total 
inventory in response to fresh fallout and its subse¬ 
quent redistribution within the shallow upper soil 
layer, as well as ongoing decay. Figure 7 illustrates the 
evolution of the depth distribution of 7 Be within a bare 
undisturbed soil in an area of cleared forest near Val¬ 
divia in southern Chile, from summer through the 
wet season (autumn and winter) to the following 
spring [87]. Over the period of measurement, the 
total inventory varies over nearly an order of magni¬ 
tude from 181 to 1,401 Bq m -2 . In Fig. 7, the depth 
distributions are represented by the total inventory 
recorded above a given depth and the individual distri¬ 
butions show a clear exponential form. The h 0 values 
listed represent the relaxation depths of the exponential 
distributions and increased values reflect greater 
penetration. During the initial period, which represents 
the end of the dry season, the inventory declines, but it 
subsequently increases at the onset of the wet season. 
At this stage, the new input of fallout to the surface 
causes an increase in the inventory held close to the 
surface and the h 0 values decline. As the wet season 
continues, however, the depth distribution deepens 
and the h 0 values increase to reflect the greater 
proportion of 7 Be held at depth. 

As shown in Fig. 6, the depth distributions of 137 Cs 
and 210 Pb ex in cultivated soils are dominated by the 
influence of soil mixing caused by tillage. The profiles 
are characterized by a fairly uniform radionuclide 
activity in the plow layer, which is well mixed by tillage, 
but the activity rapidly declines below the plow depth. 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 7 

Depth distributions of 7 Be recorded in a bare forest soil at a reference site near Valdivia in southern Chile over the seasonal 
cycle. The inventories (A ref ) and relaxation depths ( h 0 ) associated with the individual depth distributions are listed in 
the key (Based on [87]) 


Most profiles evidence a small “tail” extending below 
the plow depth, reflecting some limited downward 
movement of the radionuclide to greater depth. The 
137 Cs and 210 Pb ex profiles for cultivated soils portrayed 
in Fig. 6 were taken from eroding areas within a culti¬ 
vated field, and the total inventories are less than those 
associated with the equivalent cores from an adjacent 
area of undisturbed grassland. The reduced inventories 
reflect the progressive loss of soil containing 137 Cs or 
210 Pb ex from the surface of the profile by erosion. As 
erosion proceeds, both the inventory and the 137 Cs 
activity in the plow layer will decline, as plowing incor¬ 
porates subsoil containing little 137 Cs or 210 Pb ex into 
the plow layer. In the absence of erosion, the depth 
distribution would be characterized by a total inven¬ 
tory similar to that associated with the undisturbed 
stable site. The shape of the profile would also be 
similar to those presented in Fig. 6 for cultivated soils, 
but the activity within the plow layer would be greater. 

In contrast to the 137 Cs and 210 Pb ex profiles 
depicted in Fig. 6, the depth distribution of 7 Be for 


the cultivated field is almost identical to that recorded 
for the undisturbed area. The reason is that the 7 Be 
fallout accumulates in the upper few millimeters of the 
soil and when the field is plowed, the action of mixing 
the accumulated 7 Be into the plow layer reduces the 
activity to below the level of detection. After plowing 
and cultivation, new fallout will again accumulate near 
the surface and as time progresses and the inventory 
grows, the depth distribution will closely resemble the 
depth distribution associated with the undisturbed 
area. The reduced 7 Be inventory associated with the 
depth distribution for the cultivated field relative to 
the adjacent undisturbed field could reflect the timing 
of the prior plowing and the limited time available for 
the inventory to be restored. However, in this instance, 
the plowing occurred many months previously and the 
reduced inventory reflects the loss of surface soil by 
erosion. This situation is confirmed by the reduced 
depth to which 7 Be is found. 

Where soil redistribution within the landscape 
caused by erosion produces areas of deposition, the 
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depth distributions of the fallout radionuclides associ¬ 
ated with such depositional areas will differ from those 
for stable and eroding areas presented in Fig. 6. Assum¬ 
ing that the mobilized sediment which is subsequently 
deposited contains 137 Cs, 210 Pb ex , or 7 Be, by virtue of its 
source within the landscape, the deposition of sedi¬ 
ment, for example at the foot of a slope or on a river 
floodplain, will cause the inventory to increase and the 
depth distribution to change due to upward extension 
and thus an increase in the depth of soil or sediment 
containing the radionuclide. Figure 8 provides some 
examples of such changes associated with a river flood- 
plain site in Devon, UK. In this case, the depth distri¬ 
butions were documented for both a point on the 
river floodplain and an adjacent area of undisturbed 
pasture above the level of inundation, with no evidence 
of either erosion or deposition. In both cases, the soils 
were uncultivated and the profiles are therefore 
undisturbed by tillage mixing. In the case of the 137 Cs 
depth distributions, the profile from the floodplain has 
an inventory that is considerably greater than the pro¬ 
file from the area above the level of inundation, as 
a result of deposition of sediment containing 137 Cs. 
The profile shape also provides clear evidence of depo¬ 
sition. The shape of the 137 Cs depth distribution shown 
for the floodplain reflects the progressive reduction of 
the 137 Cs content of the sediment sources within the 
catchment, and thus also of the deposited sediment, as 
a result of the ongoing loss of 137 Cs through erosion. 

As with 137 Cs, the total inventory associated with 
the 210 Pb ex depth distribution from the depositional 
site on the floodplain shown on Fig. 8 is substantially 
greater than that of the 210 Pb ex profile for the 
undisturbed site above the level of inundation. This 
again reflects the input of 210 Pb ex to the floodplain 
profile from both fallout to the floodplain surface and 
the deposition of sediment containing 210 Pb ex . The 
depositional environment is again further confirmed 
by the substantially greater depth to which 210 Pb ex is 
found in the floodplain profile. Since the fallout input 
of 210 Pb ex to both the upstream catchment and the 
floodplain surface is essentially continuous, there is 
no reduction in activity toward the surface of the 
floodplain profile. The rate of decrease of the 210 Pb ex 
activity with depth will reflect the rate of sediment 
accretion and the rate of decay of the 210 Pb ex contained 
in the buried sediment. Since the latter is known, it 


is possible to estimate the rate of sediment accretion 
from the rate of decrease of activity with depth in the 
profile (see section “Estimating Sedimentation Rates 
and Establishing Chronologies in Depositional 
Sinks”). Comparison of the two 7 Be depth distribu¬ 
tions presented in Fig. 8 again provide clear evidence 
of deposition, as reflected by the increased inventory 
and depth of the 7 Be profile from the accreting flood- 
plain, when compared with the profile from the site 
above the level of inundation. Both these features of the 
floodplain profile indicate that the deposition occurred 
only a short time prior to the time that the 7 Be depth 
distributions were documented, since in the absence 
of new sediment deposition, any difference between 
the two 7 Be profiles would soon disappear. The higher 
7 Be surface activity associated with the floodplain 
profile reflects the two sources of 7 Be input to the 
surface, namely, direct fallout and deposition of sedi¬ 
ment containing 7 Be mobilized from the upstream 
catchment. 

The fallout radionuclide profiles from 
a depositional site illustrated in Fig. 8 relate to an area 
of undisturbed permanent pasture on a river flood- 
plain. Similar profiles will be found in other deposi¬ 
tional sinks, such as areas of deposition at the foot of 
a slope, where the soil surface will receive both direct 
fallout and an input of radionuclide-bearing sediment 
eroded from upslope. In many instances, depositional 
sites will also be cultivated. In cultivated areas, the 
depth distribution of 137 Cs and 210 Pb ex associated 
with depositional areas will again be characterized by 
increased inventories and the occurrence of the two 
radionuclides to greater depths. However, the action 
of tillage will commonly introduce greater homogene¬ 
ity into the depth profile and in many cases, the mag¬ 
nitude of the increase in inventory rather than the 
shape of the depth distribution will provide the key 
evidence as to the rate of deposition. 

Using Fallout Radionuclides as Tracers to 
Investigate Soil and Sediment Redistribution 
within the Landscape 

A sound understanding of the input and redistribution 
of 137 Cs, 210 Pb ex , and 7 Be within the landscape 
provides the key to using these three radionuclides to 
estimate rates of soil and sediment redistribution. 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 8 

Characteristic depth distributions of 137 Cs, 210 Pb e x, and 7 Be from an undisturbed pasture site (left) and from a neighboring 
regularly inundated river floodplain (right) in Devon, UK. The precision of the radionuclide measurements at the 95% 
level of confidence is indicated by the error bars 


Their distinctive behavior at erosional and depositional 
sites, in particular, provides a basis for quantifying 
erosion and deposition rates and discriminating fine 
sediment sources. Further discussion of the principles 


involved will consider: firstly, quantification of soil 
redistribution within the landscape by soil erosion, 
secondly, the estimation of sedimentation rates in 
depositional sinks such as lakes, reservoirs, and river 
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floodplains and establishing chronologies for the asso¬ 
ciated sediment deposits, and thirdly, discriminating 
the potential sources of fine sediment in catchments 
and river basins. In each case, there is a need to take 
account of differences between the three fallout radio¬ 
nuclides in terms of the temporal pattern of fallout and 
the timescale of interest and the consequent need for 
different approaches. 

Quantification of Soil Redistribution Within the 
Landscape by Soil Erosion 

As demonstrated by Figs. 6 and 8, both the total inven¬ 
tory and the depth distribution of 137 Cs, 210 Pb ex , and 
7 Be in soil or sediment can provide information on past 
soil redistribution at that point. Because the inventory 
can generally be obtained with a single measurement, 
whereas a substantial number of measurements will be 
needed to establish the depth distribution, and the 
number of samples that can be measured is frequently 
limited by detector availability and the count times 
involved (see section “Laboratory Procedures and 
Techniques”), attention usually focuses on inventory 
measurements. In most studies, the approach taken 
involves establishing a reference inventory, which rep¬ 
resents the inventory associated with an undisturbed 
location experiencing neither erosion nor deposition, 
and comparing the inventories obtained for other sam¬ 
pling points with this value. Where they are less than 
the reference inventory, it is assumed that erosion has 
occurred and, where they exceed the reference inven¬ 
tory, this is assumed to reflect deposition. In the case of 
137 Cs, the soil redistribution resulting in the reduced or 
increased inventory is assumed to have occurred over 
the period since the main period of 137 Cs fallout in the 
late 1950s and early 1960s, whereas for 210 Pb ex and 7 Be, 
the period is defined by considering the half-life of the 
radionuclide and the extent to which the current inven¬ 
tory will reflect past fallout and redistribution of that 
fallout. It is important that this comparison should 
take account of the uncertainty associated with both 
the precision of the laboratory measurements of the 
radionuclide activity used to establish the inventories 
of the sampling points and any uncertainty associated 
with the value used for the reference inventory. The 
latter will reflect both the measurement precision and 
the uncertainty introduced by small-scale spatial 


variability of the soil inventory and the sampling pro¬ 
cedure (see section “Field Techniques for Sample 
Collection”). 

In a few studies involving 137 Cs, a different repeat 
sampling approach has been employed and soil redis¬ 
tribution rates are estimated by comparing the inven¬ 
tories measured on two occasions at the individual 
sampling points [88]. The principle involved is very 
similar to the use of a reference inventory, in that 
a reduction in inventory over time is assumed to reflect 
erosion, whereas an increase is taken to indicate depo¬ 
sition. Although this alternative approach has the 
advantage that the period during which soil redistribu¬ 
tion processes have been operating to cause the chang¬ 
ing inventory is precisely defined and the need to 
establish the local reference inventory is removed, it 
also has several important limitations which are likely 
to restrict its application. The first is the need for repeat 
measurements and thus the inability to obtain results 
quickly and without preplanning. Studies where the 
approach has been used have generally been able to 
make use of data fortuitously provided from earlier 
sampling programs, but these will rarely be available. 
Secondly, there is a need to ensure that the period 
elapsed between the two surveys is of sufficient dura¬ 
tion to produce a significant reduction or increase in 
the inventory, bearing in mind the uncertainty associ¬ 
ated with the measured inventories, which will reflect 
both analytical precision and variability introduced by 
sampling. Finally, relocating sampling points may 
prove difficult. Even in areas where erosion and depo¬ 
sition rates are relatively high, it has been estimated 
that a period of 10 years between sampling campaigns 
may be required to ensure that there is a significant 
difference between the two sets of measurements. 
Where soil redistribution rates are relatively low, the 
length of the period will increase. This contribution 
will focus on the standard approach using a reference 
inventory. 

Use of 137 Cs Measurements Figure 9 provides a sche¬ 
matic representation of the principles involved in using 
137 Cs measurements to estimate soil redistribution 
rates. Here, an orchard is used as a reference site and 
the cores used to define the reference inventory and the 
characteristic depth distribution are collected between 
the trees where the presence of the trees will have had 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 9 

A schematic representation of the principles involved in the use of 137 Cs measurements to estimate soil redistribution 
rates 


little or no influence on fallout receipt. Eroding areas 
within both pasture and cultivated areas will be char¬ 
acterized by reduced inventories. Similarly, areas of 
deposition are reflected by increased inventories. The 
depth distributions provide further evidence of the 
incidence and magnitude of erosion and deposition. 
As indicated above, the soil redistribution reflected by 
the departure of the measured inventory values 
from the reference value is assumed to have occurred 
over the period between the main period of 137 Cs 
fallout in the late 1950s and 1960s and the present. 
Although information on 137 Cs inventories and depth 


distributions, such as that shown in Fig. 9, provides 
qualitative information on the magnitude and spatial 
distribution of soil redistribution rates within the 
landscape, for most purposes, there will be a need for 
quantitative estimates of soil redistribution rates. 
To obtain such estimates, it is necessary to apply what 
have become known as “conversion models” to the 
measured inventory values. 

Reviews of the various conversion models used with 
137 Cs measurements (e.g., [89-91]) commonly distin¬ 
guish between empirical models and theoretical models. 
Empirical models, as might be expected, are based on 
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the use of empirical measurements to establish the 
relationship between the magnitude of the loss or 
gain of 137 Cs inventory relative to the reference inven¬ 
tory and the soil redistribution rate. Such data are 
difficult to obtain, but there have been some attempts 
to use data from a range of erosion plots for this 
purpose, as in Australia, where relationships between 
rate of soil erosion and the reduction in 137 Cs inventory 
have been established for both cultivated and 
uncultivated soils [92]. In this case, the measured soil 
loss from the plots Y (t ha -1 year -1 ) was related to the 
reduction in 137 Cs inventory within the plot, X (%), 
using a function of the form 

Y=aX b (1) 

However, the development and use of such empir¬ 
ical models involves several important problems. Per¬ 
haps most importantly, any relationship developed will 
be time specific. This constraint is particularly impor¬ 
tant. For example, loss of 20% of the inventory by 1980 
would reflect a much higher erosion rate than loss of 
20% of the inventory by 2000 and a relationship 
established using data for 1980 would overestimate 
erosion rates if applied to recent measurements of the 
reduction in inventory. Equally, it is difficult to use this 
approach to estimate deposition rates, although, in 
some investigations, it has been assumed that the equa¬ 
tion can simply be rearranged using the same constant 
and coefficient to estimate deposition. 

Because of the lack of data that can be used to 
establish empirical models and the inherent limitations 
noted above, most of the conversion models that have 
been successfully used for estimating soil redistribution 
rates from 137 Cs measurements fall into the second 
group, namely, theoretical models. By definition, such 
models use existing understanding of the behavior and 
fate of fallout inputs in response to soil redistribution 
processes to derive a theoretical relationship between 
soil redistribution rate and the loss or gain of inventory 
relative to the reference inventory. The proportional 
model [21, 93] provides a very simple example of 
such a model that has been quite widely used for 
cultivated soils. In this case, the plow depth H (m) 
is determined and the depth of soil lost, as a proportion 
of H, is estimated by simply assuming that this propor¬ 
tion will be the same as X (%), the proportion 
of the inventory that has been lost. If the bulk density 


of the soil B (kg m -3 ) and the period over which 
erosion T (years) has occurred are known, the depth 
of soil lost can be used to estimate the erosion rate Y, 
that is, 


Although attractive in its simplicity, the basic 
assumptions underlying the proportional model are 
flawed, since it takes no account of the fact that as soil 
is eroded from the surface of the plow layer in 
a cultivated field, the depth of the plow layer will be 
maintained by incorporation of subsoil from below the 
original plow layer. Therefore, if a depth equivalent to 
the depth of the plow layer was removed by erosion, the 
soil would still contain a significant proportion of the 
original inventory and not an inventory of zero. Use of 
the proportional model will therefore underestimate 
erosion rates, and it has been shown [91] that with 
high rates of soil loss (e.g., loss of ca. 60% of the 
original inventory), the estimated erosion rate is likely 
to underestimate the erosion rate by ca. 40%. The 
simple assumptions underlying the proportional 
model relate to an eroding soil and the model does 
not explicitly provide a means of estimating deposition 
rates, where the measured inventory exceeds the refer¬ 
ence inventory. However, in several studies, the equa¬ 
tion has been extended to estimate the deposition rate, 
by again assuming that the depth of deposition repre¬ 
sents a proportion of H, equivalent to the percentage 
increase in inventory, relative to the reference inven¬ 
tory. Use of this procedure is, however, likely to under¬ 
estimate the deposition rate, since the deposited soil 
eroded from upslope and deposited at a sampling point 
is likely to be characterized by a lower 137 Cs activity 
than that assumed by the model. 

Other attempts to develop conversion models have 
aimed to provide a more realistic representation of the 
behavior of 137 Cs in the soil at locations characterized 
by erosion and deposition [90, 91] . In providing a brief 
overview of some of these models, it is useful to distin¬ 
guish between those developed for cultivated soils and 
those developed for uncultivated soils, for example, 
soils found on pasture or rangeland. In most circum¬ 
stances, reduction of the soil inventory by a given per¬ 
centage will reflect a much higher erosion rate for 
a cultivated soil, than for an uncultivated soil, since 
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the incorporation of the 137 Cs inventory into the plow 
layer means that a much greater depth of soil will need 
to be eroded to reduce the soil inventory by a given 
amount. For an uncultivated soil, most of the 137 Cs 
inventory is likely to be found close to the surface and 
reduction of the inventory by a given proportion is 
likely to reflect a much lower erosion rate than for 
a cultivated soil. 

Since they have attracted most attention in terms of 
conversion model development, attention will be 
directed first to the use of theoretical conversion 
models developed for use with cultivated soils. Most 
of these models, which aim to overcome the funda¬ 
mental inadequacy of the proportional model, use 
a mass balance approach and attempt to model the 
changes in the 137 Cs content of the plow layer in 
response to fallout inputs, radioactive decay, loss of 
137 Cs from the surface by erosion, and incorporation 
of soil containing no 137 Cs from beneath the plow 
depth by ongoing plowing, over the period since 
the commencement of 137 Cs fallout. By running the 
model with a range of erosion rates, it is possible to 
establish a relationship between erosion rate and the 
degree of reduction of the 137 Cs inventory relative to 
the reference inventory. The basic form of the mass 
balance model for an eroding site can be represented 
as follows: 



where A(t) = cumulative inventory (Bq m -2 ), t - time 
since onset of 137 Cs fallout (years), R = erosion rate 
(kg m -2 year -1 ), d m - average plow depth expressed as 
a mass depth (kg m -2 ), X - decay constant for 
137 Cs (year -1 ), and I(t) = annual deposition flux at 
time t (Bq m -2 year -1 ). 

For many locations, detailed information on the 
annual 137 Cs deposition is unlikely to be available 
and in this situation, the fallout record from a repre¬ 
sentative station in the same hemisphere (see Fig. 2) 
can be scaled to match the measured reference inven¬ 
tory for the study site. In order to estimate deposi¬ 
tion rates R r (kg m -2 year -1 ) using the same mass 
balance approach, there is a need to take account of 
the magnitude of the increase in the 137 Cs inventory. 
In this case, the deposition rate will reflect the 137 Cs 


activity in the deposited sediment, C d (t') (Bq kg : ) 
according to: 


f C d (t')e-M t - t ')dt / 

to 

where C d (t f ) = the 137 Cs activity of deposited sediment 
in year t' (Bq kg -1 ). 

The 137 Cs activity of deposited sediment can be 
estimated as the weighted mean 137 Cs activity of sedi¬ 
ment mobilized from the upslope contributing area 
(i.e., from the data provided by eroding points 
upslope). 

The general framework provided by Eqs. 3 and 4 has 
been extended by various workers to take account of 
other processes and factors that can be expected to 
influence the relationship between the erosion rate 
and the percentage reduction in the inventory relative 
to the reference inventory (e.g., [90, 94, 95]). These 
include the fate of fresh fallout which can be removed 
by erosion prior to incorporation into the plow layer by 
tillage, the particle size selectivity of erosion and depo¬ 
sition processes and the preferential association of 
137 Cs with the finer fractions of the soil and with 
organic matter, and the role of tillage erosion in 
influencing 137 Cs redistribution. In the latter case, it 
has proved possible to develop conversion models that 
permit separation of water erosion and tillage erosion 
and provide separate estimates of each. In a overview of 
the conversion models available for cultivated soils, 
Walling and He [90, 91] provide further details of 
a simple mass balance model which assumes that all 
fallout was received in 1963, a more comprehensive 
mass balance model incorporating the fate of fresh 
fallout and the size selective nature of erosion and 
deposition and the further extension of that model to 
incorporate the effects of 137 Cs redistribution by tillage. 

In the case of uncultivated soils, the conversion 
model needs to take account of the shape of the depth 
distribution, since this will influence the relationship 
between erosion rate and the depletion of the total 
137 Cs inventory. Two main approaches have been used 
in developing models for uncultivated soils [91]. The 
first, which is often termed the profile distribution 
model, is based on the assumption that the 137 Cs 
depth distribution in most uncultivated soils is 
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exponential in form and can be characterized by 
a relaxation depth h 0 (kg m -2 ). The greater the value 
of h 0 the greater the depth to which the 137 Cs extends 
into the soil. If the value of h 0 is determined for the 
reference site and it is assumed that at eroding sites, loss 
of surface soil will cause the exponential depth distri¬ 
bution to be truncated, comparison of the inventory at 
the sampling point with the reference inventory pro¬ 
vides a means of estimating the depth of soil which 
must have been removed to reduce the inventory by the 
documented amount. The erosion rate can be esti¬ 
mated by dividing the total depth of soil removed by 
the number of years elapsed since 1963, the year of peak 
fallout, namely, 


Y — 


10 


t - 1963 


-In 



ho 


( 5 ) 


where t = the year of sample collection. 

Where the measured inventory at a sampling point 
exceeds the reference inventory, indicating that depo¬ 
sition has occurred, the deposition rates can be esti¬ 
mated from the magnitude of the increase in inventory 
and the 137 Cs concentration of deposited sediment, 
using the same approach as used for cultivated soils 
in Eq. 4. 

Use of the profile distribution model assumes that 
the exponential depth distribution found at the refer¬ 
ence site would have developed at the eroding point by 
1963 and was subsequently truncated by erosion. 
This is clearly an oversimplification, since the depth 
distribution will evolve through time as a result of 
the time-variant fallout input, the progressive down¬ 
ward movement of the 137 Cs fallout received at the 
soil surface, and the removal of some fresh fallout 
from the soil surface by erosion shortly after its depo¬ 
sition. As a result, estimates of rates of soil loss pro¬ 
vided by the profile distribution model are likely to 
overestimate erosion and deposition rates. However, 
a study aimed at testing the model within a small 
catchment in southern Italy [96], by comparing the 
measured sediment yield from the catchment with 
the estimate of mean annual net soil loss provided by 
the application of the profile distribution model to 
137 Cs measurements obtained from the catchment, 
demonstrated good agreement between the two inde¬ 
pendent results. 


Uncertainties regarding the assumptions implicit in 
the profile distribution model have encouraged the 
development of an alternative approach, commonly 
referred to as the diffusion and migration model [91]. 
This model involves a mass balance approach taking 
account of the time-variant fallout input and the slow 
downward movement of the 137 Cs fallout received at 
the surface. As such, it takes account of the evolution of 
the 137 Cs depth distribution through time, rather than 
assuming a constant form. This downward movement 
is governed by two lumped parameters loosely referred 
to as the diffusion and migration coefficients, which 
can be estimated from the form of the depth distribu¬ 
tion found at the reference site. With this model, it is 
again possible to estimate the deposition rate associ¬ 
ated with a sampling point where the inventory exceeds 
the local reference inventory. The approach taken is 
similar to that represented in Eq. 4. Since this model 
more closely replicates existing understanding of the 
post-fallout redistribution of 137 Cs in undisturbed soils 
and its interaction with erosion, it is likely to provide 
more reliable estimates of soil redistribution rates in 
areas with uncultivated soils. 

Although most studies employing 137 Cs to investi¬ 
gate erosion and soil redistribution rates have focused 
on water erosion, there have been some studies that 
have attempted to extend its application to wind ero¬ 
sion. These studies have employed broadly the same 
approach of comparing the reference inventory at an 
undisturbed site unaffected by erosion or deposition 
with the inventories measured at a range of sampling 
points. Points with reduced inventories are assumed to 
represent eroding points, and points with increased 
inventories are assumed to represent points where 
deposition has occurred. In most instances, conversion 
models based on those developed for water erosion 
have been used, since the mechanisms involved are 
similar, that is, removal of soil from the surface and 
addition of sediment to the original surface by deposi¬ 
tion. As with water erosion, a range of different 
approaches have been used in the models. These 
include the proportional model [97, 98], mass balance 
models [99, 100], and a profile distribution model 
[100, 101]. However, the available evidence suggests 
that wind erosion introduces greater complexity into 
the post-fallout redistribution of 137 Cs and the use of 
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137 Cs measurements to estimate soil and sediment 
redistribution rates. Important issues include the 
potential importance of selective winnowing of fines 
by deflation processes and the possibility that reference 
sites could be influenced by dust accumulation, which 
could increase the measured inventory [102]. 

As indicated above, estimates of soil redistribution 
rates obtained using 137 Cs measurements generally 
relate to the period extending from either the onset or 
the peak period of fallout in the 1950s or 1960s to the 
time of sampling. In many respects, the lengthy period 
covered by the measurements represents an advantage 
of the 137 Cs approach, because it integrates the natural 
temporal variability of soil erosion rates. However, it 
can also represent a limitation of the approach, in that 
it generates an average erosion rate for the period and it 
is not possible to document changes in erosion rates 
resulting from land use change. However, where land 
clearance for agriculture and the onset of significant 
erosion postdates the main period of 137 Cs fallout, the 
conversion models can be modified to provide an 
estimate of soil redistribution relating only to the 
post-clearance period. In another specialist applica¬ 
tion, it proved possible to produce estimates of erosion 
rates associated with two contrasting land management 
practices that characterized the period of interest 
[103, 104]. In this case, it was possible to reconstruct 
the erosion rates occurring during a period of conven¬ 
tional cultivation extending from 1954 to 1986 and 
a subsequent period under no-till management 
extending from 1986 to 2003. By measuring the current 
137 Cs activity of the former plow layer associated with 
the period of conventional tillage, it was possible to 
reconstruct the inventory of a sampling point at the 
end of this period and thus estimate the erosion rate 
under conventional tillage using a mass balance con¬ 
version model. Similarly, the current depth of the orig¬ 
inal plow layer was estimated from measurements of 
the current total 137 Cs inventory and the current 137 Cs 
activity of the plow layer at individual sampling points 
and this value was compared with the plow depth 
during the period of conventional cultivation to pro¬ 
vide an estimate of the depth of soil eroded during the 
period under no-till management. To use this 
approach, it is important that the overall period cov¬ 
ered by the 137 Cs measurements (i.e., ca. 50 years) 
should include substantial periods under both 


management practices and thus that the change 
in management should coincide approximately with 
the middle of the period covered by the 137 Cs 
measurements. 

Use of 210 Pb ex Measurements The use of 210 Pb ex 
measurements to investigate rates and patterns of soil 
redistribution within the landscape involves similar 
principles to those involved in the use of 137 Cs and 
shown schematically in Fig. 9. As with 137 Cs, the inven¬ 
tories and depth distributions documented at individ¬ 
ual measuring points can be compared with equivalent 
information obtained from an undisturbed reference 
site and a conversion model is used to estimate soil 
redistribution rates. The key differences between the 
use of 137 Cs and 210 Pb ex measurements relate to the 
contrasting nature of the fallout input and the shorter 
half-life of 210 Pb ex (22.3 years vs 30.17 years). The 
essentially continuous nature of 210 Pb ex fallout means 
that, unlike the situation with 137 Cs, there is no 
well-defined period (e.g., from the commencement of 
fallout to the time of sampling) for which soil redistri¬ 
bution rates are documented. The 210 Pb ex inventory 
measured at a sampling point will reflect the interac¬ 
tion between the fallout input and soil redistribution 
processes over an extended period. This period will 
reflect the half-life of the radionuclide, such that the 
impact of past soil redistribution on the present day 
inventory will become progressively less as the period 
extends back into the past. It is frequently indicated 
that 210 Pb ex measurements can be used to estimate the 
erosion occurring over a period representing about 
five half-lives and thus over the past ca. 100 years. 
In principle, this is correct, but it must be recognized 
that erosion occurring 100 or even 50 years ago will 
have much less influence on the current inventory 
than erosion occurring in the past 10 or 20 years. 
Thus, although 210 Pb ex measurements will reflect ero¬ 
sion and soil redistribution taking place over a longer 
period than 137 Cs, the measurements are likely to be 
more sensitive to recent erosional activity than 137 Cs 
measurements. 

Walling and his coworkers [38,105] describe several 
conversion models which have been developed for 
deriving estimates of soil redistribution rates, based 
on the degree of reduction or increase in the 210 Pb ex 
inventory relative to the reference value. These models 
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are very similar to several of those developed for 137 Cs 
and described above, the key difference being the need 
to incorporate a continuous and essentially constant 
annual fallout input, rather than fallout input over 
a limited period. The mass balance model for cultivated 
soils incorporates many of the refinements associated 
with the comprehensive mass balance model for 137 Cs 
described by the same authors. These include size selec¬ 
tive sediment mobilization and deposition and the fate 
of fresh fallout prior to its incorporation into the plow 
layer by tillage. The mass balance model has also been 
extended to incorporate the effects of soil displacement 
by tillage. Because 210 Pb ex fallout is continuous and 
essentially constant through time, it is particularly 
important that the fate of fresh fallout, prior to incor¬ 
poration into the soil by tillage, should be represented 
in a physically realistic manner, since this will exert 
a key control on the 210 Pb ex inventory. If erosion occurs 
whilst the recent fallout remains at the surface, 
a relatively small amount of erosion could remove 
a significant proportion of the fresh input, whereas 
once the fresh fallout is mixed into the plow layer, the 
same amount of erosion will remove a much smaller 
amount of 210 Pb ex . 

Because the 210 Pb ex fallout is continuous, it is not 
possible to use a conversion model equivalent to the 
profile distribution model developed for 137 Cs, when 
investigating soil redistribution on uncultivated 
soils. That model assumes that the depth distribution 
is established during the main period of 137 Cs fallout 
and is subsequently truncated by surface erosion. 
With 210 Pb ex , the continuous fallout will replenish 
the 210 Pb ex removed from the upper part of the soil 
by erosion and the depth distribution is likely to 
achieve a quasi steady state. Under these circumstances, 
a model very similar to the diffusion and migration 
model developed for 137 Cs has been developed. 

A different approach to estimating soil erosion rates 
using 210 Pb ex measurements is described by Wallbrink 
and Murray [36]. This approach, which was applicable 
to uncultivated soils combined measurements of both 
137 Cs and 210 Pb ex and was recommended for use in 
areas, such as forests, where local variability of 137 Cs 
fallout made it difficult to establish a 137 Cs reference 
inventory. In this case, attention was directed to the 
contrasting depth distributions of the two fallout radio¬ 
nuclides, with 137 Cs evidencing deeper penetration 


than 210 Pb ex in the study area in Tasmania, Australia. 
The different penetration depths resulted in a change in 
the ratio between the 210 Pb ex inventory (below 
a specific depth in the profile) and the inventory of 
137 Cs below the same depth. The ratio decreased mono- 
tonically with depth, down to ca. 8 cm, with values of 
>2.0 near the surface and <1.0 at depths of ca. 4 cm. 
The ratio provides a unique measure of the depth 
within the profile and can be used to estimate the 
depth of erosion that has occurred at a particular 
point, either over the period since the occurrence of 
bomb fallout or since erosion commenced as a result 
of land clearance or disturbance, if this occurred later. 
Once the depth distribution of the 210 Pb ex / 137 Cs ratio 
is established for the study site using an undisturbed 
reference site, bulk cores are collected from individual 
sampling points and measurement of the 137 Cs and 
210 Pb ex inventories permits the ratio to be calculated 
and the depth of soil eroded to be estimated. The 
approach is in many respects similar to the use of the 
profile distribution model to estimate the depth of 
erosion on an uncultivated soil from measurements of 
the 137 Cs inventory undertaken on bulk cores. In that 
case, the exponential depth distribution of 137 Cs pro¬ 
vides a means of estimating the depth of soil removed 
by erosion, whereas with this approach, the depth dis¬ 
tribution of the ratio provides the means of estimating 
the depth of soil removed from the inventory ratios 
calculated for individual bulk cores. 

Use of 7 Be Measurements In contrast to 137 Cs and 
210 Pb ex , the value of 7 Be for documenting soil redistri¬ 
bution rates reflects its short half-life (53 days) and its 
potential for obtaining information relating to individ¬ 
ual events or short periods of heavy rainfall (e.g., 1-2 
weeks). However, although there is increasing interest 
in this potential, relatively little work has been under¬ 
taken to date using this fallout radionuclide. One early 
attempt to use 7 Be to estimate soil redistribution rates 
employed a mass balance approach, analogous to the 
repeat monitoring approach developed for use with 
137 Cs [35]. In this approach, the 7 Be inventory was 
documented for a number of sampling points imme¬ 
diately prior to the event or short study period. At 
the end of the event or period of interest, the 7 Be 
inventory was measured again at adjacent points. The 
7 Be fallout input during the intervening period was 
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estimated using a rainfall collector. If the period stud¬ 
ied is sufficiently short to permit the effects of radioac¬ 
tive decay to be ignored, the amount of erosion 
E (kg m -2 ) at the sampling point associated with the 
period between the collection of the two sets of samples 
can be estimated as: 


where A 0 and A x are the 7 Be inventories (Bq m -2 ) at the 
beginning and end of the study period, respectively; D 
(Bq m -2 ) is the fallout input during the study period; 
and C is the 7 Be content of the eroded soil (Bq kg -1 ). 

The study undertaken by Wilson et al. [35] was 
undertaken on an erosion plot, and in this case, C 
could be established by collecting the eroded sediment 
and measuring its 7 Be content. In that study, C could 
also have been estimated by using the measurements of 
soil loss available from the erosion plot to estimate the 
depth of erosion during the event or study period and 
to then determine C from the depth distribution of 7 Be 
established using sectioned cores. In other applications, 
it is likely to prove more difficult to determine C, which 
is clearly a critical component of Eq. 6. Use of this mass 
balance approach must be seen as rather complex, since 
it involves a set of “before” and “after” measurements, as 
well as measurements of fallout input during the period 
of interest and information on the 7 Be content of eroded 
sediment. As such, it could be seen as providing exper¬ 
imental validation of the potential for using 7 Be to 
document short-term soil redistribution rates, rather 
than a more generally applicable procedure. 

Most other attempts to use 7 Be for documenting 
soil redistribution rates (e.g., [32, 33, 106, 107]) have 
adopted a simpler approach more similar to that 
employed with 137 Cs and 210 Pb ex . This involves com¬ 
parison of the inventories documented at a number of 
sampling points at the end of the period of interest 
using bulk cores, with a reference inventory, also mea¬ 
sured at the end of the period of interest, in order to 
establish the magnitude of the increase or decrease in 
inventory in response to soil redistribution. However, 
the basis of the approach differs in several respects from 
that associated with 137 Cs and 210 Pb ex . Firstly, as indi¬ 
cated above, attention focuses on short periods, usually 
a single storm event or a short period of heavy rainfall 
(e.g., 1-2 weeks). Secondly, study areas are usually 


unvegetated. Where a vegetation cover exists, 
a significant proportion of the 7 Be fallout input to 
a site is likely to be intercepted and will be retained by 
the vegetation cover until it is lost by decay. This situ¬ 
ation differs from that for 137 Cs and 210 Pb ex , where, by 
virtue of their much longer half-lives, the fallout will 
still eventually reach the soil surface after leaf fall or 
decay of the vegetation cover. This interception of the 
incident 7 Be fallout by the vegetation cover is likely to 
cause the fallout input to the soil surface itself to be 
spatially variable and it is not possible to assume 
a spatially uniform reference inventory. Equally, if the 
total inventory contained in both the vegetation can¬ 
opy and the soil is considered, variable partitioning of 
the inventory between the two components makes it 
difficult to link changes in the inventory to soil redis¬ 
tribution. For this reason, the use of 7 Be to study soil 
erosion and soil redistribution rates is generally 
restricted to bare cultivated soils. 

As emphasized by Figs. 6 and 7, a key feature of the 
depth distribution of 7 Be in the soil is its restriction to 
a very shallow layer of the surface soil, generally only ca. 
1-2 cm deep. Within this layer, the depth distribution is 
commonly characterized by a rapid exponential 
decrease with depth. Little difference is found between 
the depth distributions associated with cultivated and 
undisturbed soils, since both are dominated by the 
fallout input to the surface. Where cultivation occurs, 
mixing of the 7 Be into the plow layer will cause the 
activity to fall below the level of detection. The conver¬ 
sion models generally used with 7 Be measurements 
(e.g., [33, 107]) are very similar to the profile distribu¬ 
tion model employed with 137 Cs measurements. The 
7 Be exponential depth distribution at the reference site 
can be characterized by the relaxation depth h 0 kg m -2 
and the mass depth of soil eroded from a sampling 
point R kg m -2 can be estimated by comparing the 
7 Be inventory at the sampling point A Bq m -2 with that 
for the reference site A ref Bq m -2 , and relating the 
difference to h 0 , namely, 

R=ki>n {¥) (7) 

Where the measured 7 Be inventory exceeds the refer¬ 
ence inventory, deposition is assumed to have occurred 
and the mass depth of deposition can be estimated, as 
with 137 Cs (Eq. 4), by dividing the excess 7 Be inventory 
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by the mean 7 Be content of sediment mobilized by 
erosion from the upslope contributing area. 

The procedure commonly used with 7 Be measure¬ 
ments is to visit the study site after a storm event or 
short period of heavy rainfall (e.g., 1 week) and to 
collect cores from a reference site and from the study 
site. The cores collected from the reference site should 
include a sectioned core that is used to determine h 0 . By 
applying Eq. 7, it is possible to estimate the soil redis¬ 
tribution that occurred during the period considered. 
The short half-life of 7 Be and the emphasis on 
documenting individual events do, however, impose 
a number of constraints on the use of 7 Be for estimating 
soil redistribution which do not apply to the use of 
137 Cs and 210 Pb ex to estimate longer-term erosion rates 
covering periods of several decades [14]. Most impor¬ 
tant is the assumption that the spatial pattern of inven¬ 
tories documented by the cores collected after an event 
or period of heavy rainfall reflects soil redistribution 
occurring during that period and that the preexisting 
7 Be inventory can be considered to have been spatially 
uniform across the study area. This means that this 
approach can only be applied to events or periods of 
heavy rainfall separated by relatively long periods with 
little rainfall or at least rainfall or insufficient intensity 
to cause erosion and soil redistribution. A period of ca. 
5 months (i.e., three half-lives) is ideally required to 
ensure that the spatial variability of the 7 Be inventory 
inherited from previous events has effectively 
disappeared as a result of decay. Cultivation of a field 
provides an alternative means of removing or resetting 
the signal from past events, since the existing inventory 
will be mixed into the plow layer and is likely to be 
below the level of detection. This constraint can seri¬ 
ously limit the potential for using 7 Be measurements to 
document short-term erosion rates, since effectively 
only the first period of heavy rainfall following an 
extended dry period or the cultivation of the study 
site can be studied. In many environments, the events 
of most interest may occur later in the wet season or it 
might be important to consider the whole wet season. 
As the length of the period of rainfall studied increases, 
the estimate of erosion or soil redistribution obtained 
will underestimate the true value to a progressively 
greater degree. Walling et al. [87] used a hypothetical 
example to demonstrate that if the period extends to 
6-7 weeks and this includes a number of rainfall events 


of similar magnitude distributed through that period, 
the amount of soil eroded could be underestimated by 
as much as 50%. 

Increasing recognition of the potential value of 7 Be 
in providing a means of documenting short-term ero¬ 
sion rates, to complement the estimates of longer-term 
erosion rates provided by 137 Cs and 210 Pb ex , has 
directed attention to the possibility of extending the 
timescale over which 7 Be measurements can be applied 
and removing the constraint of only being able to 
document the first events occurring after a dry period. 
Such developments will necessarily increase the com¬ 
plexity of the approach and the need for additional 
information on, for example, fallout inputs, changing 
values of /z 0 , and the distribution of erosion within the 
period considered. Walling et al. [87] recently reported 
the development of a novel procedure for applying 7 Be 
measurements, involving such additional information, 
which permitted erosion and soil redistribution to be 
quantified over a period of ca. 3 months. The procedure 
was developed for application in Chile where soil ero¬ 
sion during the period following forest harvesting was 
being studied and there was a need to document erosion 
occurring during the entire wet season. 

Estimating Sedimentation Rates and Establishing 
Chronologies in Depositional Sinks 

The fallout radionuclides 137 Cs and 210 Pb ex have been 
widely used to provide estimates of sedimentation rates 
in depositional sinks, including water bodies such as 
lakes and reservoirs, wetlands and river floodplains and 
for establishing the chronology of sediment cores 
collected from such sinks. In some studies, the empha¬ 
sis may be on documenting sedimentation rates 
(g cm -2 year -1 or mm year -1 ) and thus quantifying 
sediment accretion or sediment storage, whereas in 
others, attention may focus on the chronology. In the 
latter context, there may, for example, be a need to 
establish the chronology of changes in the contaminant 
content of sediment from different depths in a core, in 
order to reconstruct the pollution history of a study site 
and the upstream catchment area. Some attempts have 
been made to use 7 Be for these purposes, but this 
radionuclide has been less widely applied in sedimen¬ 
tation studies, partly because of its short half-life. With 
137 Cs, the radionuclide provides information relating 
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to the period extending from the main period of bomb 
fallout to the present (i.e., ca. 50 years). In areas where 
significant Chernobyl fallout occurred, it may be pos¬ 
sible to separate the periods before and after 1986. With 
210 Pb ex , the timescale extends back ca. 100 years. In 
contrast, where 7 Be is used, it can provide information 
on very recent sedimentation rates or confirm the pres¬ 
ence of recently deposited sediment. Much of the early 
work using 137 Cs and 210 Pb ex for dating sediment cores 
and estimating sedimentation rates focused on lakes, 
and there is now a large body of literature on this topic 
(e.g., [47-51, 108-110]). Less work has been under¬ 
taken on other sediment sinks, such as river flood- 
plains, and these will therefore be emphasized in this 
contribution, in order to demonstrate the potential for 
using fallout radionuclides to study the movement of 
sediment through the landscape. 

Use of 137 Cs Measurements Figure 10 provides an 
example of the depth distributions of 137 Cs and 210 Pb ex 
in cores collected from the overbank deposits on the 
floodplain of two UK rivers, the River Arun and the 
River Tone, and these provide a useful background for 
outlining the various approaches that can be used to 
estimate sedimentation rates and to establish age/depth 
relationships for sediment cores collected from sedi¬ 
ment sinks. The contrasting features of 137 Cs and 
210 Pb ex fallout, and more particularly the event-related 
(i.e., bomb testing) nature of the former and the con¬ 
tinuous nature of the latter, are reflected by the differ¬ 
ent principles involved in the use of the two 
radionuclides. With 137 Cs, attention focuses on the 
known fallout record (see Fig. 2), reflecting the first 
appearance of significant fallout in the early to mid- 
1950s and the occurrence of peak fallout in 1963 and in 
some locations a further short-lived input in 1986, as 
a result of the Chernobyl disaster. The 137 Cs activity of 
the sediment deposited in a lake, floodplain, or other 
sediment sink can be expected to reflect this fallout 
record through two mechanisms. The first mechanism 
represents direct fallout to surface of the sediment sink 
that will be incorporated into the depositing sediment. 
In the case of a lake or reservoir, the 137 Cs fallout to the 
surface of the water body will be dispersed within the 
water and adsorbed by sediment in the water column, 
which is subsequently deposited. With a floodplain or 
marsh, the fallout will be delivered directly to the 


surface of the accreting sediment, where it will “label” 
the surficial sediment. The second mechanism repre¬ 
sents a catchment-derived contribution, reflecting 
137 Cs associated with deposited sediment that was 
mobilized from the catchment surface, which would 
itself have received 137 Cs fallout. As demonstrated in 
Fig. 5, the cumulative inventory of the catchment sur¬ 
face, and thus the 137 Cs content of eroded sediment, 
will also vary through time and follow a similar pattern 
to the direct fallout, peaking around 1963. The precise 
shape of the 137 Cs depth distribution found in a lake or 
on a river floodplain will reflect the relative contribu¬ 
tion of direct fallout and catchment-derived 137 Cs to 
the accumulating sediment [111, 112], but in most 
situations, the sediment deposited in or around 1963 
will possess the highest 137 Cs activity. This provides 
a well-defined time marker. Thus, the clear 137 Cs 
peaks evident at mass depths of ~18 and 15 g cm -2 
for the cores from the Rivers Tone and Arun, respec¬ 
tively, in Fig. 10 can be dated to 1963 and a mean 
sedimentation rate can be calculated by dividing the 
total mass or depth of sediment by the period elapsed 
between 1963 and the date of core collection. Some 
workers have used the depth of the first appearance of 
137 Cs as an alternative time marker, which can be dated 
to ca. 1954. However, this approach is not to be 
recommended, since there is clear evidence that in 
most depositional sinks, the “tail” of the 137 Cs depth 
distribution will be influenced by post-depositional 
downward migration that will cause overestimation of 
the sedimentation rate. As indicated above, in some 
locations influenced by Chernobyl fallout, the existence 
of 137 Cs peaks in a sediment core that can be linked to 
1963 and 1986 can provide a basis for estimating sed¬ 
imentation rates for two periods (i.e., 1963-1986 and 
1986 to present) [113, 114], and evaluating recent 
changes in sedimentation rates associated with changes 
in land use or land management. 

The depth at which the 137 Cs peak is found is 
generally considered to be unaffected by downward 
migration in lake sediment deposits, where mixing is 
commonly assumed to be of limited importance. How¬ 
ever, the possibility of some downward migration must 
be considered when studying floodplain deposits, since 
the peak 137 Cs activity is commonly found just below 
the surface in natural soils. The 137 Cs depth distribu¬ 
tion from a representative undisturbed soil adjacent to 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 10 

Depth distributions of 137 Cs and 210 Pb ex measured in sediment cores collected from the floodplains of the River Arun and 
River Tone, UK 


the floodplain study site can be used to estimate the 
likely downward migration rate of the 137 Cs peak R m 
(kg m -2 year -1 ) by dividing the present depth of that 
peak (expressed as a mass depth) by the time elapsed 
between the year of peak fallout and the present T p 
(years). Thus, if the mass depth at which the peak 137 Cs 
activity occurs in the floodplain core D pk (kg m -2 ) is 
known, and R m has been estimated, the mean sedimen¬ 
tation rate on the floodplain R r (kg m -2 year -1 ) can be 
estimated as: 


It is important to recognize that R\ as estimated 
using Eq. 8 or simply from the depth of the 137 Cs 
peak in a sediment deposit, represents an average 
sedimentation rate and that for some sediment sinks, 
there may be marked inter-annual variability in the 
amount of deposition. On some floodplains, for 
example, most deposition may be associated with 
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infrequent high magnitude floods with recurrence 
intervals of >10 years. 

The above approach to estimating sedimentation 
rates assumes that the sediment core is from an 
undisturbed location and therefore that the 137 Cs 
depth distribution reflects the interaction of the fallout 
input with the progressive sediment accumulation and 
any associated catchment contribution of 137 Cs. In 
some locations, the floodplain surface may be culti¬ 
vated and this will cause mixing of the sediment within 
the plow layer. In this situation, it will no longer be 
possible to use the 137 Cs depth distribution to estimate 
the sedimentation rate. However, several studies [ 115— 
117] have shown that it is still possible to estimate the 
sedimentation rate, if the floodplain has been regularly 
cultivated since the beginning of 137 Cs fallout. If no 
sedimentation has occurred, 137 Cs will only be found to 
a depth equivalent to the plow depth. If, however, 
sedimentation has occurred, its depth can be estimated 
from the depth to which 137 Cs is found below the plow 
depth, due to the progressive accretion of the surface. 
Comparison of the total depth of the 137 Cs profile 
which has been homogenized by plowing D t (kg m -2 ) 
with the plow depth D pl (kg m -2 ) provides a means of 
estimating the sedimentation rate, namely, 


,/ _ A Dy\ 

~ T 
1 v 


( 9 ) 


The need to obtain detailed information on the 
depth distribution of 137 Cs within a sediment core 
can represent an important constraint in the applica¬ 
tion of the approach embodied in Eq. 8, which requires 
information on the depth of the 137 Cs peak. Due to 
likely limitations on the number of core slices that can 
be assayed, due to cost or time constraints, it may not 
be possible to document the small-scale spatial vari¬ 
ability of sedimentation rates. This may be less of 
a problem in lakes, where sedimentation is likely to be 
relative uniform within a small area, but it can be 
a significant problem for river floodplain investiga¬ 
tions, where sedimentation rates can be characterized 
by marked spatial variability, in response to the flood- 
plain microtopography, distance from the main chan¬ 
nel, and the vegetation cover (e.g., [45]). Walling and 
He [118] provide a means of addressing this problem 
by describing a procedure for estimating the sedimen¬ 
tation rate from measurements of the total inventory 


(Bq m -2 ) of bulk cores collected from individual points 
on the floodplain. The bulk cores are assumed to con¬ 
tain the total depth of 137 Cs for the sampling points and 
the key feature of the approach is the estimation of the 
excess inventory, which is defined as the total inventory 
less the direct fallout contribution. The latter can be 
established using cores collected from adjacent refer¬ 
ence site located above the level of flood inundation. 
The excess inventory i ex (Bq m -2 ) represents the 137 Cs 
input associated with deposited sediment, and its mag¬ 
nitude will reflect the rate of deposition and the 137 Cs 
concentration in the deposited sediment. The latter will 
clearly vary through time in response to the accumu¬ 
lation of 137 Cs in the catchment soils and its subsequent 
mobilization by erosion. Walling and He [ 1 18] describe 
a model that can be used to estimate the time-weighted 
mean concentration of the deposited sediment and 
thus the deposition rate. However, they also propose 
a simpler and more direct approach for estimating the 
deposition rate for individual sampling points, based 
on measurements of the inventories of bulk cores. This 
makes the assumption that the temporal variation of 
the 137 Cs content of the deposited sediment would have 
been similar across the floodplain surface and that the 
main factor accounting for spatial variation in the 137 Cs 
content of deposited sediment will be its grain-size 
composition. If the sedimentation rate R' 0 (kg m -2 
year -1 ) and excess 137 Cs inventory / eXj0 are known for 
a specific representative point on the floodplain for 
which a well-defined 137 Cs profile obtained from 
a sectioned core is available, the sedimentation rate R' 
at other points on the floodplain can be estimated from 
a comparison of the values of excess inventory i ex and 
the grain-size composition of the sediment at those 
points with the equivalent information for the refer¬ 
ence point, that is, 


R' = K 


/ A 


Ax,0 



( 10 ) 


where 5^,0 and S& represent the specific surface area of 
surface sediment (m 2 g -1 ) from the reference point and 
the sampling point, respectively, and v = a constant 
reflecting the form of the relationship between 137 Cs 
concentration and specific surface area, which has been 
shown to have a value of ca. 0.65-0.7 [84]. 

It is important that the second term in Eq. 10 
should be included, in order to take account of the 
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influence of grain-size composition on 137 Cs activity 
and the preferential association of 137 Cs with finer 
sediment. In Eq. 10, the assumption is made that the 
grain-size composition of surface sediment is represen¬ 
tative of that for the entire depth profile. The approach 
involving the use of measurements of the inventory of 
bulk cores represented by Eq. 10 can also be used for 
cultivated floodplains, although in this case, will be 
estimated using Eq. 9 rather than Eq. 8. The use of bulk 
core measurements in the estimation of floodplain 
accretion rates depends heavily on the assumption 
that the deposited sediment is characterized by a sig¬ 
nificant 137 Cs activity, which in turn results in readily 
measurable excess inventories. In some environments, 
the deposited sediment may contain low levels of 137 Cs 
or no 137 Cs, because of the nature of the sediment 
sources in the upstream catchment or the low invento¬ 
ries existing in the catchment. In this situation, it may 
be difficult to quantify the limited excess inventory due 
to the low precision of the gamma spectrometer mea¬ 
surements and the 137 Cs depth distribution may repre¬ 
sent the only viable approach to quantify the 
deposition rate. A situation of this nature has been 
described for the floodplain of a river in central 
Queensland Australia [119]. 

Use of 210 Pb ex Measurements Because of the essen¬ 
tially continuous nature of 210 Pb ex fallout, the 210 Pb ex 
depth distributions associated with sediment sinks do 
not provide a distinct time marker, as provided by 
137 Cs. As with 137 Cs, the 210 Pb ex content of the accreting 
sediment will reflect both direct fallout to the lake or 
floodplain surface and 210 Pb ex associated with sedi¬ 
ment mobilized from the upstream catchment. If it is 
assumed that the sediment surface is progressively bur¬ 
ied by deposition, the 210 Pb ex activity of the sediment in 
a core will decline with depth due to radioactive decay. 
The rate of decline in the 210 Pb ex activity with depth 
provides a means of estimating the age of the sediment 
at different depths and thus the sedimentation rate. 
The 210 Pb ex depth distributions for the floodplain of 
the Rivers Tone and Arun shown in Fig. 10, for exam¬ 
ple, demonstrate that the 210 Pb ex activity declines to 
zero at a mass depth of about 45-55 kg m -2 . Since it is 
normally assumed that 210 Pb ex activity will decline to 
near-zero after ca. five half-lives, the sediment at this 
depth can be estimated to be about 110 years old. 


A preliminary estimate of the sediment age at different 
depths can be obtained by assuming that the 210 Pb ex 
activity in the deposited sediment declines exponen¬ 
tially with time in accordance with the radioactive 
decay law, namely, 

C d = C 0 e~ u (11) 

where C d is the activity at a given time, C 0 is the initial 
activity of the deposited sediment, X is the decay con¬ 
stant for 210 Pb (0.03114 year -1 ), and t is the number of 
years since burial. 

Use of Eq. 11 to estimate sediment age assumes that 
the activity of the sediment at the time of its initial 
deposition is constant through time and thus that the 
activity is not influenced by the sedimentation rate. In 
the real world, these assumptions may not apply and 
a range of different models of varying complexity have 
been developed for interpreting the 210 Pb ex depth dis¬ 
tributions in sediment cores, in order to obtain reliable 
estimates of sediment age or sedimentation rates. In 
these models, depths are commonly expressed as mass 
depths (g cm -2 ) to avoid complications due to sedi¬ 
ment compaction and downcore variations in bulk 
density. The models involve a number of different 
assumptions. In the simplest model, often referred to 
as the Constant Flux, Constant Sedimentation rate 
(CFCS) model, a constant initial concentration for 
the deposited sediment C 0 Bq kg -1 and a constant 
dry mass sedimentation rate r g cm -2 year -1 , and 
therefore a constant depositional flux are assumed, 
resulting in the following relationship: 

C m = C 0 e~^ r (12) 

where C 0 and C m are the 210 Pb ex concentrations at the 
surface of the core and at mass depth m, respectively. 

When log C m is plotted against m, the resulting plot 
will be linear and the mean sedimentation rate r can be 
estimated from the slope of the best-fit line. 

In many situations, however, it is unrealistic to 
assume a constant sedimentation rate, particularly in 
environments where extreme events play an important 
role in the operation of erosion and sedimentation 
processes. Equally, and perhaps more importantly, 
human impact on erosion rates as a result of land 
clearance and land use change will mean that sedimen¬ 
tation rates cannot be treated as stationary, but must be 
seen as being likely to have changed significantly over 
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the past 100-150 years covered by the 210 Pb ex measure¬ 
ments. Two models, generally referred to as the Con¬ 
stant Rate of Supply (CRS) and Constant Initial 
Concentration (CIC) models, have been developed to 
take account of variable sedimentation rates [49, 120, 
121]. In the first model, the annual supply of 210 Pb ex to 
the accreting sediment is assumed to be constant, 
meaning that the activity of the sediment deposited in 
a given year will vary according to the amount of 
sediment deposited. If this is high, the activity will be 
reduced. With the CIC model, the initial activity of the 
deposited sediment is assumed to remain constant 
through time. Neither assumption is necessarily 
a close fit to reality. In this context, it is important to 
recognize the two sources of supply of 210 Pb ex to the 
accreting sediment, namely, direct fallout to the lake 
or floodplain surface and supply of 210 Pb ex in associa¬ 
tion with the eroded sediment. Temporal variation 
of the magnitude of the latter in response to changes 
in erosion rates and sediment flux will breach the 
assumption of the CRS model. Equally, variation of 
the mass of sediment deposited will mean that the 
initial 210 Pb ex concentration associated with that sedi¬ 
ment will vary through time, since the direct fallout 
input will be mixed with a variable mass of sediment. 
Furthermore, the initial 210 Pb ex activity of the sedi¬ 
ment mobilized from the upstream catchment can be 
expected to vary as sediment sources change as a result 
of land clearance and land use change within the catch¬ 
ment. If, for example, gully erosion assumes increasing 
importance, the 210 Pb ex activity of the sediment mobi¬ 
lized from the upstream catchment might be expected 
to decline. 

Appleby [109] provides an explanation of the pro¬ 
cedures involved in applying the CRS model which 
permits the age t of the sediment at a given depth m 
to be calculated as: 



where A 0 and A m are the total inventories below the 
surface and below the given depth m, respectively. 

The sedimentation rate r g cm -2 year -1 at time t 
can also be calculated as: 


where C m is the 210 Pb ex concentration at depth m and 
time t and A m is the total inventory below this depth. 

With the CIC model, the initial 210 Pb ex concentra¬ 
tion of the sediment at depth m and of age t is assumed 
to have been the same as the current concentration of 
sediment at the sediment-water interface or floodplain 
surface C 0 . If the present concentration C of the sedi¬ 
ment at this depth is known, its age t can be calculated 
using the relationship defined in Eq. 11, that is, 

c = C 0 e~ Xt 

The CIC model implicitly assumes that the supply 
rate of 210 Pb ex is proportional to the sedimentation rate 
and this situation will not generally apply. For this 
reason, the CRS model is often assumed to provide 
more meaningful estimates of the sedimentation rate 
and, in the case of lakes, this has been confirmed by the 
similarity of CRS model-derived chronologies to the 
137 Cs chronology and independent measures of sedi¬ 
ment age based on counting of annual laminae. To date, 
little attention has been given to validating the models 
when applied to sediment cores from river floodplains. 

As with the interpretation of 137 Cs depth distribu¬ 
tions or profiles, use of the above models to estimate 
sedimentation rates or sediment age requires detailed 
information on the 210 Pb ex depth distribution and this 
may introduce constraints on the number of cores that 
can be studied. Such limitations may be of limited 
importance in lakes, where sedimentation rates are 
unlikely to vary markedly over short distances, but for 
river floodplains and other terrestrial sediment sinks, 
such as wetlands, much greater spatial variability may 
occur and there may be a need to document this. Faced 
with the need to document the detailed patterns of 
sediment deposition on river floodplains, He and 
Walling [122] described a model which could be 
applied to bulk cores for which a measurement of the 
total 210 Pb ex inventory was available. In this case, the 
complexity introduced by the dual source of 210 Pb ex , 
namely, direct fallout to the floodplain surface and 
210 Pb ex associated with the sediment mobilized from 
the upstream catchment, was reduced by focusing only 
on the latter component. This was calculated by 
subtracting the local reference inventory A ref from the 
total inventory A tota i, since the former represents solely 
the fallout input. The model assumed a constant initial 
concentration for the deposited sediment C r and 
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a constant sedimentation rate r (averaged over a period 
of ca. 100 years) and was termed the CICCS model. The 
mean sedimentation rate for the past ca. 100 years (i.e., 
ca. five half-lives) can be estimated as: 

_ n ^total ^ref /-. 


C r is a key parameter in Eq. 15 and its estimation 
may prove difficult. It can be determined empirically by 
using sedimentation traps (e.g., [123]) to collect sam¬ 
ples of the sediment deposited during overbank flood 
events and measuring its 210 Pb ex content. Equally, it 
can be estimated from measurements of the 210 Pb ex 
activity of suspended sediment transported during 
flood events, although potential contrasts between the 
210 Pb ex activity of suspended sediment and deposited 
sediment associated with contrasts in grain-size com¬ 
position should be taken into account. In the absence 
of such direct measurements, C r can also be estimated 
from measurements of the 210 Pb ex activity of surface 
sediment from the floodplain and subtracting 
the direct fallout contribution to that concentration. 
Figure 11 presents the age-depth relationships derived 
from the 137 Cs and 210 Pb ex depth distributions 
presented in Fig. 10 using Eq. 8 for 137 Cs and the 
CICC model for 210 Pb ex . There is good agreement 


between the estimates provided by the two radionu¬ 
clides, although the results suggest that deposition 
rates in more recent years have increased for the 
River Tone and decreased for the River Arun. Such 
changes could reflect a range of factors, including 
changing sediment loads and changes in the frequency 
and depth of overbank flooding related to either 
changes in flood generation in the catchment or the 
effects of progressive floodplain accretion in increas¬ 
ing the bankfull stage. The latter would give rise to 
a reduction in the frequency and depth of overbank 
inundation. 

In addition to the relatively simple conceptual 
models for deriving estimates of sedimentation rates 
within sediment sinks from 210 Pb ex depth distribu¬ 
tions, described above, more complex inductive 
models that permit the interpretation of 210 Pb ex pro¬ 
files in situations where both sedimentation rates and 
210 Pb ex fluxes vary with time have been developed 
[51, 124]. In this case, signal theory is used to compare 
the actual 210 Pb ex profile with model profiles produced 
using realistic values of the various parameters and, by 
minimizing an objective function, it is possible to 
deduce the model form and the sedimentation rate. 
The Sediment Isotope Tomography (SITS) model 
described by Carroll and Lerche [51] is probably the 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 11 

A comparison of the age-depth relationships established for the floodplain cores illustrated in Fig. 10, using the 137 Cs 
depth distribution and the 210 Pb ex depth distribution coupled with the CICC model 
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most widely applied model of this type, and some 
success has been achieved in incorporating bioturba- 
tion and mixing into the model structure. 

For most sediment sinks, it is reasonable to assume 
quasi-continuous sedimentation, even though sedi¬ 
mentation rates may vary from year to year and over 
longer periods. This is the situation on many river 
floodplains where the river inundates the floodplain 
several times a year and accretion is ongoing. In some 
situations, however, sedimentation may be episodic 
and this introduces complications for interpreting the 
sediment profile, as this may reflect episodic accumu¬ 
lation of sediment, separated by periods of several 
years, or even a decade, with no sedimentation. 
A situation of this nature has been described for the 
floodplains of the Beni and Mamore Rivers in the 
Andean-Amazonian Foreland [125], but it still proved 
possible to use 210 Pb ex measurements to reconstruct the 
sedimentation history of the floodplain over the past 100 
years and to demonstrate the importance of extreme 
ENSO (El Nino/La Nina) events within this history. 
Eleven major events in the past 90 years were identified. 
In this case, a model referred to as the CIRCAUS (con¬ 
stant initial reach clay activity, unknown sedimentation) 
model was used. When applying this model, the 210 Pb ex 
activity is normalized to reflect the clay (<4 pm) con¬ 
tent of the sample, since most of that activity was 
shown to be associated with the clay fraction. The 
same CIRCAUS model was successfully used to study 
the chronology of floodplain sedimentation for the 
Strickland River in Papua, New Guinea [126]. 

Using 7 Be Measurements In comparison with 137 Cs 
and 210 Pb ex , relatively little use has been made of 7 Be to 
date for investigating sedimentation rates within sedi¬ 
ment sinks in fluvial systems. Its use for this purpose 
has, however, been more fully exploited in coastal 
environments. Because of the short half-life of 7 Be 
(53 days), it can provide an effective means of identi¬ 
fying recently deposited sediment. Older sediment can 
be assumed to contain no 7 Be. This approach has been 
successfully used in shallow marine environments [127, 
128] and a similar approach was employed in a study of 
the deposition and resuspension of PCB-rich sedi¬ 
ments in the Fox River, Wisconsin, USA [129]. That 
study emphasized that appreciable short-term deposi¬ 
tion and remobilization occurred, with short-term 


accumulation rates greatly exceeding the longer-term 
rates documented using 137 Cs measurements. 

Although it is normally assumed that fallout radio¬ 
nuclides such as 7 Be will be preferentially associated 
with fine sediment, and particularly the clay fraction, 
recent work has demonstrated how coarser sand-sized 
(63 pm-2 mm) fluvial sediment can also retain 7 Be and 
that the radionuclide can be used to provide informa¬ 
tion on the residence time of such sediment in channel 
bars in rivers and on the sedimentation rates associated 
with such bars [130]. In a study of the Ducktrap River 
in Maine, USA, the evolution of channel bars associ¬ 
ated with obstructions caused by large woody debris 
and the associated bar accumulation rates were inves¬ 
tigated using 7 Be measurements and the application of 
a variant of the CIC model to interpret the depth 
distribution of 7 Be in the channel bars. 

7 Be measurements can also be used in a similar 
manner to 137 Cs and 210 Pb ex to estimate floodplain 
sedimentation rates. However, because of its short 
half-life, attention focuses on the sedimentation asso¬ 
ciated with individual events, rather than longer-term 
average rates of accretion. If shallow cores are collected 
from a floodplain after a flood event, the 7 Be invento¬ 
ries of those cores will reflect both the recent fallout to 
the floodplain surface and 7 Be associated with the sed¬ 
iment mobilized from the upstream catchment and 
deposited on the floodplain. If the local reference 
inventory for the time of sampling A re f Bq m -2 is 
established, by collecting shallow cores from an adja¬ 
cent reference site above the level of the inundating 
floodwaters, subtraction of this value from the inven¬ 
tories measured for the individual sampling points 
enables the excess inventory A ex Bq m -2 associated 
with the deposited sediment to be calculated. The mag¬ 
nitude of this value will directly reflect the amount of 
sediment deposited and if the 7 Be concentration of the 
newly deposited sediment Q Bq kg -1 is known, the 
amount of sediment deposited D f kg m -2 can be esti¬ 
mated, namely, 

D,=^ (16) 

Q is an important parameter in Eq. 16 and can 
prove difficult to establish. It could be measured 
directly by collecting samples of the deposited sediment 
using a sedimentation trap, or the 7 Be activity of the 
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suspended sediment transported by the river sampled 
during the flood event could be used as a surrogate. In 
both cases, however, the effects of the particle size 
composition of the sediment in influencing its 7 Be 
concentration must be considered. If the first approach 
is used, it is unlikely to prove feasible to install sedi¬ 
ment traps adjacent to each sampling point and only 
a single sample or a composite sample comprising 
sediment collected from several traps may be available. 
In this situation, the 7 Be activity of the sample of 
deposited sediment C t must be adjusted to provide an 
estimate of C d for each sampling point by taking 
account of any difference in particle size between the 
sediment deposited at the sampling point and the sed¬ 
iment collected using the sediment trap. As with Eq. 10, 
which applied a similar approach for correcting 137 Cs 
activity to reflect the variation of the grain-size com¬ 
position of sediment between individual sampling 
points, the measure of specific surface area (m 2 g -1 ) 
provides a convenient means of comparing the particle 
size composition of samples. If surface samples are 
collected adjacent to each sampling point and the spe¬ 
cific surface area of those samples S d and the sample 
collected by the sediment trap S t is measured, C d for 
individual sampling points can be estimated as: 


Q = Q 



(17) 


where v = a constant reflecting the form of the rela¬ 
tionship between 7 Be concentration and specific sur¬ 
face area with a value of ca. 0.64 [53] . If the estimate of 
C d is based on measurement of the 7 Be activity of 
suspended sediment, the values of C t and S t in Eq. 17 
must be replaced by the values for the suspended 
sediment. 

The procedure outlined above has been successfully 
used to document the spatial variability of sediment 
deposition during a single overbank flood event within 
a reach of the River Culm floodplain in Devon, UK 
[53]. When applying this approach, it important to 
ensure that the excess inventory inherited from previ¬ 
ous events is minimal and this means that the overbank 
flood studied should be preceded by a period of ca. 
5 months with no inundation of the floodplain. 
To overcome this constraint, it might be possible to 
collect cores from the floodplain both before and after 
the event, but this may introduce problems in terms of 


forecasting the occurrence of the flood that is to be 
studied and collecting samples from closely adjacent 
points, so that the pre- and post-flood 7 Be inventories 
can be directly compared. Since most floodplain sur¬ 
faces are vegetated, the presence of grass and other 
vegetation can also introduce problems, but this is 
less significant than for studies of soil redistribution 
within fields. The shallow cores should include the 
vegetation mat, and because the amount of sedimenta¬ 
tion is calculated from the excess inventory, no 
assumption is made as to whether the deposited sedi¬ 
ment is held within the vegetation mat or if it is on the 
floodplain surface. 

As indicated above, most work on using fallout 
radionuclides to estimate sedimentation rates and sed¬ 
iment chronologies for sediment sinks has focused on 
lakes and reservoirs, which commonly represent sim¬ 
pler systems than other sinks, such as river floodplains 
and wetlands. For example, it is generally assumed that 
the fallout input to a lake surface will be mixed within 
the body of the lake and will be attached to depositing 
sediment particles, which may already contain the same 
radionuclide by virtue of having been mobilized from 
the upstream catchment. As such, the two components 
of the total activity of the deposited sediment associ¬ 
ated with a given year will be closely integrated and 
contained within the sediment deposit for that year. In 
the case of a river floodplain, however, the fallout to the 
floodplain surface will be deposited on the sediment 
surface over the course of the year and will be distrib¬ 
uted within the shallow surface layer according to its 
initial distribution. If overbank sedimentation occurs 
during a flood, this will be deposited on top of the 
fallout input and the two may not be integrated. Fur¬ 
thermore, depending on the relative magnitude of the 
deposition rate and the relaxation depth of the initial 
distribution, the fallout input may be distributed 
through the sediment deposited over several years, or 
alternatively, it may only penetrate the upper part of 
the sediment deposited in a given year. Equally, there is 
likely to be greater temporal variability in the gain size 
composition of sediment deposited on a river flood- 
plain, according to the magnitude of the flood events 
and this may influence the depth distribution of the 
fallout radionuclide, with coarser horizons being 
marked by lower activity. As noted above, Aalto and 
his coworkers have overcome this problem in their 
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studies of the floodplains of large rivers, such as the 
Amazon and the Strickland River in Papua New Guinea 
[125, 126], by normalizing the 210 Pb ex measurements 
for the clay (<4 pm) content of the individual sedi¬ 
ment sections recovered from the cores. 

Tracing Suspended Sediment Sources and Sediment 
Transfer 

In addition to their use for estimating soil redistribu¬ 
tion rates and rates of sedimentation in sediment sinks, 
such as lakes and river floodplains, fallout radionu¬ 
clides also provide important opportunities for tracing 
suspended sediment sources and for developing an 
improved understanding of sediment transfers in 
catchments and river basins. Concern for the off-site 
impacts of soil erosion in providing increased inputs of 
fine sediment to watercourses and for the many adverse 
impacts of increased fine sediment loads in river sys¬ 
tems has focused attention on the need for sediment 
management or control strategies aimed at reducing 
sediment mobilization and transport. A range of con¬ 
trol measures have been developed and successfully 
employed to reduce fine sediment fluxes, but the design 
of effective control strategies requires that control mea¬ 
sures should target the key sediment sources and trans¬ 
fer pathways, in order to optimize the return on 
expenditure. Expenditure on reducing soil loss from 
cultivated areas could, for example, bring limited 
return if much of the sediment transported by 
a stream is mobilized by channel erosion. For this 
reason, an assessment of the primary suspended sedi¬ 
ment sources in a catchment is often seen as an essential 
precursor to designing and implementing a sediment 
control strategy [131]. 

Sediment source fingerprinting techniques are 
increasingly used to provide information on sediment 
sources [132]. In essence, these techniques involve 
collecting samples of the fine sediment transported by 
a stream or river or deposited on its bed and comparing 
the chemical and physical properties of this sediment 
with those of potential sources. Such comparisons can 
be used to identify the primary source of the sediment 
and if a numerical mixing model is used (e.g., [131— 
135]), it is possible to quantify the relative contribu¬ 
tion of a range of potential sources. In this context, 
source can represent spatial sources (e.g., different 


subcatchments or zones within a catchment), but 
more often emphasis will be placed on what can be 
referred to as source types , which reflect the nature of 
the erosion process or sediment source, for example, 
rill and interrill erosion on cultivated fields, and on 
land occupied by pasture or rangeland, gully erosion, 
erosion of tracks and unmetalled roads, and bank and 
channel erosion. 

The success of sediment source fingerprinting stud¬ 
ies depends heavily on the availability of tracer or 
fingerprint properties that are able to discriminate 
between potential sources. If the aim is to assess the 
relative importance of a number of different sources, 
several fingerprint properties will commonly be used. 
A wide range of sediment properties have been success¬ 
fully used as source fingerprints, and these include 
inorganic and organic geochemistry, mineral magne¬ 
tism, sediment color, stable isotopes and radionuclides. 
Fallout radionuclides, and particularly 137 Cs, 7 Be, and 
210 Pb ex , have proved particularly useful as source fin¬ 
gerprints, since their fallout origin makes them essen¬ 
tially independent of the underlying geology or soil 
type and underpins their successful use for discrimi¬ 
nating between different potential sediment source 
types [135]. In this context, the surface of 
a catchment will be exposed to direct fallout and the 
activity found in surface soils will be substantially 
greater than that associated with sediment exposed in 
channel banks and ditches or in gullies. Equally, where 
the soil is cultivated, the activity of 137 Cs and 210 Pb ex in 
the surface soil can be expected to be significantly lower 
than in an undisturbed soil, such as might be found in 
an area of pasture or rangeland. Furthermore, the dif¬ 
ferent nature and behavior of 7 Be fallout can provide 
potential for further discrimination, since this will only 
label the upper 1 or 2 cm of a cultivated soil, whereas 
137 Cs and 210 Pb ex will be mixed throughout the plow 
layer. This can provide a means of distinguishing sed¬ 
iment mobilized by sheet or interrill erosion and rill 
erosion, since the surface layer mobilized by sheet or 
interrill erosion will be characterized by high 7 Be activ¬ 
ity, whereas the 7 Be activity of sediment mobilized by 
rill erosion will be lower, as it includes sediment mobi¬ 
lized from a greater depth within the plow layer. 

Figure 12 provides a further demonstration of the 
ability of fallout radionuclides, in this case 137 Cs and 
7 Be, to discriminate between potential sediment 
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Fallout Radionuclides and the Study of Erosion and 
Sedimentation. Figure 12 

The use of 7 Be and 137 Cs activity to discriminate between 
five potential suspended sediment sources in 
a small 1 km 2 catchment in Devon, UK 


sources in a small (ca. 1 km 2 ) catchment in Devon, UK. 
By considering both the 137 Cs and 7 Be activities, it is 
possible to distinguish sediment mobilized from five 
potential sources. Sediment mobilized by sheet erosion 
from cultivated and pasture fields is characterized by 
a high 7 Be activity, by virtue of its surface origin, but 
the cultivated and pasture sources are clearly distin¬ 
guished by their 137 Cs activity, which is much lower in 
the cultivated soil, due to the homogenization of the 
plow layer by tillage. Where rill erosion is the dominant 
erosion process on cultivated land, the 137 Cs activity of 
the mobilized sediment is close to that associated with 
sediment mobilized from cultivated land by interrill 
erosion, but the 7 Be activity is much lower, due to the 
inclusion of soil from below the surface containing 
little or no 7 Be. In contrast, channel banks are charac¬ 
terized by both low 137 Cs and 7 Be activities, because of 
the limited exposure of the face to fallout and the 
presence of both radionuclides in only the upper part 
of the bank profile. Eroding trackways are also distin¬ 
guished by their radionuclide fingerprint. As erosion 
proceeds, the existing 137 Cs inventory is removed and 
the eroded surface material is characterized by a low 
137 Cs activity. However, the 7 Be activity of sediment 
from this source is seen to be intermediate between that 
associated with channel banks and sediment mobilized 


from the surface of cultivated and pasture fields by 
sheet erosion. Because the 7 Be content of surface mate¬ 
rial is continually replenished by fresh fallout, the sur¬ 
face of eroding trackways can be characterized by an 
appreciable 7 Be activity. The sediment mobilized from 
the surface of eroding trackways is, however, likely to 
have a higher activity than that mobilized by rill ero¬ 
sion from cultivated fields, since the rills will penetrate 
deeper and mobilize some sediment with a low 7 Be 
activity. The addition of 210 Pb ex to the results presented 
in Fig. 12 is likely to demonstrate further contrasts 
between the potential sources, since the ongoing nature 
of 210 Pb ex fallout will provide some replenishment of 
the stock of this radionuclide in the surface horizons of 
eroding areas, whereas the effective cessation of 137 Cs 
fallout in the late 1970s means that no replenishment 
has occurred since that time. 

The discrimination between various potential sed¬ 
iment sources provided by 7 Be and 137 Cs activity that is 
demonstrated in Fig. 12 means that these two fallout 
radionuclides are valuable fingerprint properties and 
the potential to distinguish sources could be increased 
further by looking also at 210 Pb ex activity. However, 
additional fingerprint properties are likely to be 
required if a mixing model is used to provide definitive 
estimates of the relative contribution of the potential 
sources to the sediment sampled at the outlet of 
a catchment, in order to avoid problems of equifinality. 
Statistical procedures, such as multiple discriminant 
function analysis [131, 136], can be used to identify 
the optimum set of fingerprint properties or composite 
fingerprint, as it is often termed, for discriminating 
between all of the potential sources. There is also 
a need to recognize that contrasts in particle size com¬ 
position and organic content between the sediment 
sampled at the catchment outlet or in the river, and 
the source material can mean that it is not possible to 
compare sediment and source material properties 
directly, and that some correction of the property 
values for particle size and organic matter content 
may be required [136]. Recent work has recognized 
the many uncertainties associated with characterizing 
the fingerprints of potential sources, and Monte Carlo 
techniques and Bayesian statistics have been used to 
explicitly recognize this uncertainty within the mixing 
models used to establish the relative contribution of 
different sources [137, 138]. 
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The use of fallout radionuclides as sediment source 
fingerprints, both alone and with other sediment prop¬ 
erties, has been successfully demonstrated at a range of 
scales. At the plot scale, for example, the contrasting 
depth distributions of 7 Be, 137 Cs, and 210 Pb ex have been 
shown to provide a means of establishing the relative 
contribution of sheet and rill erosion to the sediment 
output from small erosion plots under rainfall simula¬ 
tors and documenting the temporal variation of the 
sediment contributions from the two sources [139, 
140]. At the larger field scale, Whiting and coworkers 
[141] investigated the erosion processes operating on 
a 6.03 ha cultivated field near Treynor, Iowa, USA, and 
used information on the different depth distributions 
associated with 7 Be, 137 Cs, and 210 Pb ex and measure¬ 
ments of the radionuclide content of the sediment 
output from the field during a storm event in mass 
balance calculations to estimate the areal extent of rill 
and sheet erosion and the characteristic depths of ero¬ 
sion associated with each process. Rill erosion was 
found to produce 29 times more sediment than sheet 
erosion, with rills eroding 0.38% of the field to a depth 
of 35 mm and sheet wash eroding 37% of the field to 
a depth of 0.012 mm. Within larger catchments, He and 
Owens [142] describe the use of 137 Cs, 210 Pb ex , and 
226 Ra activities to establish radionuclide fingerprints 
for cultivated land, uncultivated land, and channel 
banks within the 273 km 2 catchment of the River 
Culm in Devon UK and the application of these finger¬ 
prints to estimate the relative contribution of the three 
sources to the sediment output from the catchment. 
Similarly, Whiting et al. [143] working within the Yel¬ 
lowstone River catchment in Montana, USA, used mea¬ 
surements of 7 Be and 210 Pb ex to discriminate two 
primary sediment sources, namely, the catchment sur¬ 
face and eroding channel banks, and to estimate the 
changing relative importance of these sources along 
a 423 km 2 reach of the river. 

In addition to their use as source fingerprints, fall¬ 
out radionuclides have also been used as tracers to 
provide an improved understanding of the movement 
or transfer of sediment through river systems. In the 
study undertaken in the Yellowstone River in Montana 
[143] referred to above, radionuclide activities in the 
suspended sediment sampled at different locations 
along the river were used to investigate the sediment 
transport dynamics. Fine sediment transport distances 


were estimated from the exponential decrease in 137 Cs, 
210 Pb ex , and 7 Be concentrations downstream, and the 
transport distance was found to increase from a few km 
in the headwaters to hundreds of km downstream. 
Measurements of the 137 Cs and 7 Be activities of 
suspended sediment and fine bed sediment collected 
within the catchment of the River Clyst in Devon, 
UK were also used by Blake and coworkers [53] to 
explore the importance of short-term storage and 
remobilization of sediment within the channel system 
to downstream sediment delivery. The ratio of 7 Be to 
210 Pb ex has also been successfully employed to assess 
the importance of recently resuspended bottom sedi¬ 
ment to the suspended sediment moving within the 
Savannah Estuary, USA [144]. In this estuary, 7 Be 
activities were very low in the bottom sediment, when 
compared to those associated with recent suspended 
sediment inputs, while there was little difference in 
210 Pb ex activity between the two sediment types. 

As noted above, 7 Be has been found to provide 
a useful chronometer for coarser sand-sized fluvial 
sediment (transitional bedload) deposited in channel 
bars [130] and the labeling of this coarser sediment 
with 7 Be has also been exploited to document bedload 
transport velocities and the residence time of bed 
deposits [145]. In that study undertaken in Vermont, 
USA, the ratio of 7 Be to total 210 Pb was used in prefer¬ 
ence to 7 Be alone, since use of the ratio takes account 
of variations in grain size between samples. When 
sediment is deposited and stored on the riverbed, its 
7 Be activity declines through time through decay, 
whereas the 210 Pb activity remains essentially constant. 
The magnitude of the ratio can therefore provide infor¬ 
mation on the short-term history of sediment deposi¬ 
tion, remobilization, and replenishment at a site. The 
existence of a flood control dam on a river also pro¬ 
vided the opportunity to investigate bedload transport 
velocities. Closure of the dam gate in early December 
caused the trapping of bedload and when the dam 
gate was opened in late March, the bedload which 
had been stored for several months and therefore had 
a near-zero 7 Be activity was released. By studying the 
downstream migration of this “dead” sediment, it was 
possible to document bedload transport velocities and 
to establish the length of the reach over which the 
influence of the dam in controlling bedload transport 
was evident. 
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Field and Laboratory Techniques 

A detailed description and discussion of the field 
and laboratory techniques and procedures associated 
with the use of fallout radionuclides in the study of 
erosion and sedimentation falls outside the scope of 
this contribution. The reader is referred to the Hand¬ 
book for the Assessment of Soil Erosion and Sedimenta¬ 
tion Using Environmental Radionuclides edited by 
F. Zapata and published by Kluwer Academic Publishers 
in 2002 [29] for a detailed discussion of most of the 
field techniques and laboratory procedures employed 
when using 137 Cs measurements. Many of these are 
common to 7 Be and 210 Pb ex . Some further description 
and discussion of techniques will also be included in 
the case studies or examples presented at the end of this 
contribution. However, a brief overview of some key 
features of the field and laboratory techniques 
employed and important issues that frequently need 
to be addressed is provided below. This deals firstly 
with field techniques for sample collection, and sec¬ 
ondly with laboratory analysis of the samples collected. 

Field Techniques for Sample Collection 

In most cases, the application of fallout radionuclides 
in soil erosion and sedimentation investigations 
involves the collection of samples from the field, for 
example, soil cores for use in studies of soil redistribu¬ 
tion, cores from lakes or river floodplains for 
documenting sedimentation rates or establishing the 
chronology of sediment deposits, or samples from 
potential sources and samples of suspended sediment 
for use in sediment source fingerprinting investiga¬ 
tions. As with most environmental sampling, there is 
an important need to ensure that the samples are rep¬ 
resentative and that they will provide results that can be 
readily interpreted. A carefully designed sampling pro¬ 
gram is therefore an essential precursor to the program 
of laboratory analysis. The need for representative sam¬ 
ples involves both a temporal and a spatial dimension 
and both dimensions are important. When collecting 
samples of suspended sediment for establishing 
suspended sediment sources, for example, the temporal 
dimension can be very important and it is necessary to 
recognize that the properties of the sediment passing 
a sampling point may vary considerably through time, 
in response to different stages of the hydrograph and 


the routing of sediment from different parts of the 
catchment which may contribute at different times. It 
will therefore often be necessary to collect time- 
integrated samples, in order to provide representative 
information on the source of the sediment. Time inte¬ 
grating suspended sediment samplers or traps [146] 
provide one means of collecting such samples. These 
traps also have the advantage that they collect relatively 
large samples. Sizeable amounts of sediment may be 
required for subsequent laboratory analysis and if 
instantaneous samples are collected, a continuous 
flow centrifuge may be required to recover the sedi¬ 
ment from large water samples [147]. 

When sampling fields to investigate patterns and 
rates of soil redistribution or collecting cores from lakes 
or river floodplains to document sedimentation rates, 
the temporal dimension will generally be of very lim¬ 
ited importance for studies employing 137 Cs and 
210 Pb ex > since these radionuclides provide informat¬ 
ion on medium- or longer-term process rates (i.e., 
50-100 years) and their inventories are unlikely to be 
influenced by the conditions immediately preceding 
the time of sampling. However, when 7 Be is being 
used, the timing of sampling can be a key consideration 
in terms of providing definitive results, since there is 
need to avoid the possibility of past events influencing 
the measurements obtained. 

Important issues related to the spatial dimension 
extend across the sampling associated with most appli¬ 
cations of fallout radionuclides in the study of erosion 
and sedimentation. A key consideration here is that the 
cores, which frequently represent the main means of 
sample collection in studies of soil redistribution or 
sedimentation, are generally of relatively small diame¬ 
ter and are therefore characterized by a small or, 
perhaps more correctly, a very small, surface area. Sam¬ 
pling points must be carefully selected and the sam¬ 
pling scheme must be carefully designed to obtain 
representative data. When studying soil redistribution 
within fields, a density of 20 cores ha -1 is likely to be 
considered fairly dense, but such cores are typically ca. 
8 cm in diameter and the cores will therefore typically 
only cover <0.001% of the surface area under investi¬ 
gation. Transects and sampling grids are frequently 
employed in studies of soil redistribution or sedimen¬ 
tation patterns and rates to provide representative data, 
but it is important to also recognize the uncertainty 
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that is likely to be introduced by microscale sampling 
variability, such that if the core was taken a few centi¬ 
meters away from the initial sampling point, a different 
inventory value might be obtained [148]. Problems of 
sampling variability assume particular importance 
when samples are collected to define the local reference 
inventory [149, 150]. In addition to carefully selecting 
a stable undisturbed site that should provide a reliable 
estimate of the reference inventory, there is a need to 
collect an appreciable number of cores from the site, 
the results for which can be averaged, in order to obtain 
a reliable estimate of that inventory. The inventories 
measured for individual cores collected from a refer¬ 
ence site can frequently be characterized by a coefficient 
of variation of ca. 20%, or perhaps more, and this 
means that about 16 cores would need to be collected 
to provide a mean value that was within 10% of the true 
mean at the 95% level of confidence. The use of 
a scraper plate [151], which collects the sample from 
a larger area, as an alternative to cores, can help provide 
more representative samples. Similar considerations 
apply when collecting samples to be used to represent 
the fingerprint properties of potential sources in 
source fingerprinting investigations. It is important to 
ensure that the samples collected are representative of 
the sediment that might be mobilized from a given 
source and the uncertainty introduced by representing 
the final fingerprint by mean property values must be 
recognized. Problems of spatial variability are generally 
less important when selecting the locations for collec¬ 
tion of cores to be used for determination of the chro¬ 
nology of sediment deposits, but again it is important 
to collect the core from a representative position and it 
is generally considerably more difficult to appreciate 
potential subsurface variability or heterogeneity than 
variability related to readily visible surface conditions. 

Corers will be a key requirement in many investi¬ 
gations employing fallout radionuclides to document 
soil and sediment redistribution and sedimentation. 
A range of different corers have been developed for 
use in different materials and environments, including 
lakes and reservoirs, wetlands, and soils and these are 
described in the literature. Sample size may be an 
important constraint, particularly with sectioned 
cores, and large diameter corers may be required to 
provide sufficient mass for the analysis of radionuclide 
activity, if the core is sectioned into small depth 


increments (e.g., 1 cm). For soils, cylindrical core 
tubes are commonly used, but if soils are compacted 
and dry, it may prove difficult to propel the core tube 
into the soil to the required depth (e.g., 60 cm). In this 
situation, a motorized percussion driver is often used 
to insert the core tube and a winch device may be 
required to extract the core tube. A scraper plate 
[151] can provide an alternative to a core, particularly 
if detailed depth incremental sampling is required, but 
use of these devices can prove very time consuming 
relative to the use of sectioned cores. One particular 
problem commonly faced when using 7 Be measure¬ 
ments is the need to section a shallow core into very 
fine depth increments (e.g., 2 mm), in order to define 
the precise shape of the depth distribution, which may 
only extend down to ca. 15 mm. In order to obtain 
samples of sufficient mass for the individual depth 
increments, a corer with a sizeable diameter will be 
required and devices involving a piston moved by 
a screw thread have been developed to permit the 
extrusion of the core to be carefully controlled. 

Laboratory Procedures and Techniques 

A key feature of the use of fallout radionuclides is that 
for most applications, the activities of the radionu¬ 
clides of interest can be readily measured by gamma 
spectrometry and, provided suitable detectors are 
available, the radionuclides can be measured contem¬ 
poraneously during the same counting operation. This 
removes the need for separate analyses. Gamma spec¬ 
trometry is therefore the main analytical technique 
employed in fallout radionuclide investigations, 
although, in some studies focusing on 210 Pb, alternative 
analytical procedures using radiochemical separations 
and alpha counting are used [152]. However, this 
approach involves chemical extractions which require 
a substantial time input; furthermore, it can only pro¬ 
vide measurements of total 210 Pb. Additional gamma 
spectrometry measurements would still be needed to 
quantify the 226 Ra and supported 210 Pb activity and 
thus calculate the 210 Pb ex activity. In the case of sedi¬ 
ment cores, collection of samples from depths below 
the likely influence of 210 Pb ex fallout, where the mea¬ 
sured 210 Pb activity can be assumed to represent 
supported 210 Pb, can provide a means of establishing 
the likely supported 210 Pb activity of the sediment, 
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although this value could change upcore. Another 
important advantage of gamma spectrometry is that 
only minimal sample preparation is required. Samples 
need only to be dried and disaggregated and possibly 
sieved prior to placing in containers to be counted. 
However, one key limitation of the assay of environ¬ 
mental samples by gamma spectrometry is the long 
count times involved. These are commonly of the 
order of 6-12 h per sample, and where activities are 
lower, count times of 24 h or more may be necessary. 
The long count times will frequently impose important 
limitations on the number of samples that can be 
analyzed within an investigation, since gamma detec¬ 
tors are relatively expensive and it is likely to be possible 
to operate only a limited number of detectors in 
a laboratory. With 137 Cs and 210 Pb ex , dealing with 
large backlogs of samples may not represent too great 
a problem, apart from the time required to complete 
a study. However, with 7 Be, the short half-life means 
that samples must be analyzed relatively quickly after 
collection and this will commonly impose a limit on 
the number of samples that can be collected and ana¬ 
lyzed within a particular study. It is important that such 
studies should be carefully planned to take account of 
these constraints to ensure that the final data set is 
complete. 


Further details of the instrumentation required for 
analysis of samples by gamma spectrometry and of the 
procedures employed when analyzing soil and sedi¬ 
ment samples are available in specialist handbooks 
[ 153] . Measurement of 137 Cs, 7 Be, and 210 Pb ex activities 
is undertaken using Germanium detectors, which have 
a high resolution and are therefore able to distinguish 
the signals produced by different radionuclides at sim¬ 
ilar energy levels. Figure 13 provides a useful summary 
of the main features of the various Germanium gamma 
detectors that are available, and attention should be 
directed to both the geometry and the energy ranges 
of these detectors. Although 137 Cs and 7 Be can be 
measured on standard gamma detectors, the measure¬ 
ment of 210 Pb requires a low-energy detector that is 
capable of undertaking measurements at the low- 
energy level (46 keV) associated with this radionuclide. 
Fortunately, the low-energy detectors needed for 210 Pb 
can, if required, also be used to measure 137 Cs and 7 Be, 
as well as to make the necessary measurements to 
determine the 226 Ra content of the sample required to 
calculate the supported 210 Pb activity. Where large 
samples (e.g., 1 kg) are involved, Marinelli beakers are 
commonly used as the sample container, as these fit 
over the detector head and are designed to optimize the 
positioning of the sample relative to the detector for 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 13 

The configuration, geometry, and response characteristics of Germanium detectors used for gamma spectrometry [154] 
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counting. Where small samples (e.g., 50-100 g) are 
counted, these can be put into small plastic pots that 
are placed on the detector head. For smaller samples 
(e.g., ca. 10 g), such as may be obtained from the 
sectioning of small diameter sectioned cores or from 
small samples of suspended sediment collected from 
a river, a well detector is commonly used. In this case, 
the sample is placed in a tube which is inserted into 
a hole in the top of the detector in order to optimize the 
counting geometry. 

Although gamma spectrometry is relatively 
undemanding in terms of sample preparation, the 
detectors must be carefully calibrated to provide 
accurate measurements [153]. In the case of 210 Pb 
measurements, self-absorption effects can introduce 
problems and must be considered. The principle of 
measurement, which depends on the interaction of 
gamma rays (photons) produced by the disintegration 
of the radionuclide, an essentially random process, 
with the germanium crystal, means that the precision 
of the results obtained is heavily dependent on the 
number of disintegrations counted. This in turn 
depends on the activity of the sample, the efficiency 
of the detector, and the counting time. All results 
should therefore be accompanied by an indication of 
their likely precision, and this uncertainty must be 
recognized when using the results. The precision is 
typically of the order of ±5-10% at the 95% level of 
confidence. In the case of 137 Cs and 7 Be, the radionu¬ 
clides can be measured directly by gamma spectrome¬ 
try, but determination of 210 Pb ex requires two 
measurements. The first quantifies the total 210 Pb con¬ 
tent of the sample and the second establishes the 226 Ra 
content, which is used to calculate the supported 210 Pb 
activity by assuming equilibrium between 226 Ra and 
210 Pb. The 210 Pb ex activity is then calculated by 
subtracting the supported 210 Pb activity from the 
total 210 Pb activity. The Radium-226 content of 
a sample is usually determined by measuring the activ¬ 
ity of its daughters 214 Bi and 214 Pb, because its direct 
measurement is complicated by interference from other 
radionuclides. The need for two measurements and the 
subsequent subtraction means that the precision of the 
final estimates of 210 Pb ex activity is usually lower than 
that associated with 137 Cs and 7 Be measurements. 

Although gamma spectrometry measurements are 
generally undertaken in the laboratory, to which 


samples collected in the field are transported, attention 
has been directed to the potential for using in situ 
detectors [155, 156]. In this case, the detector is 
deployed in the field in a portable configuration. The 
detector head is positioned ca. 1.5 m above the soil 
surface, and the gamma rays received from the soil are 
counted. Because gamma rays will be received from 
a relatively large area, the approach overcomes some 
of the sampling variability problems associated with 
the collection of small core samples and this, as well 
as the possibility of obtaining immediate results, are 
seen as important potential advantages. However, 
problems can arise in calibrating the measurements, 
since the precise area of soil from which gamma rays 
are received can prove difficult to define, due to atten¬ 
uation effects. Equally a given radionuclide inventory 
can produce different signals according to the depth 
distribution of the radionuclide in the soil, and there is 
a need to make certain assumptions regarding the likely 
form of the depth distribution. Self-absorption effects 
also mean that the approach is unlikely to be applicable 
to 210 Pb measurements. Furthermore, in most environ¬ 
ments, substantial count times (e.g., ca. 1 h) will be 
required and this may limit the number of points for 
which measurements can be collected. As a result, the 
use of field portable detectors has not been widely 
adopted and further research is required to assess 
more fully the potential and limitations of the 
approach. However, the approach has proved more 
successful for 137 Cs measurements in areas that 
received substantial amounts of Chernobyl fallout, 
since the high activities in such areas greatly reduced 
the count times required [157]. Furthermore, with 
such high activities, the 137 Cs signal is more readily 
distinguished from those of other radionuclides with 
similar energy levels and low cost scintillation detectors 
can be used in place of more costly Germanium 
detectors. 

Uncertainty Considerations in the Use of Fallout 
Radionuclide Measurements 

The brief overview of field and laboratory techniques 
provided above has emphasized the potential uncer¬ 
tainty introduced into the application of fallout radio¬ 
nuclide measurements by sampling variability and 
measurement precision. Any measurement of the 
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fallout radionuclide activity or inventory is likely to 
involve a significant degree of uncertainty by virtue of 
both the heterogeneity of natural soils and sediments 
and the associated sampling variability and the limited 
precision of gamma spectrometry measurements. It is 
important that this uncertainty should be recognized, 
at least implicitly, if not explicitly, and if possible prop¬ 
agated through subsequent use of the data, for example 
to derive estimates of soil redistribution rates by com¬ 
paring measurements of the fallout radionuclide inven¬ 
tory made at individual sampling points with the local 
reference inventory. 

Some Examples of the Application of Fallout 
Radionuclides in Erosion and Sedimentation 
Investigations 

Further consideration of the application of fallout 
radionuclides in the study of erosion and sedimenta¬ 
tion within the environment can usefully involve a brief 
overview of several case studies, based on investigations 
undertaken by the author and his coworkers, which 
demonstrate the potential of the approach across 
a range of applications. These include soil erosion, 
floodplain sedimentation, and sediment source 
tracing. 

Soil Erosion: A Field Scale Investigation 

Although most work involving the use of fallout radio¬ 
nuclides in soil erosion investigations has been based 
on measurements of 137 Cs, both 210 Pb ex and 7 Be have 
also been used in similar applications. By collecting 
cores from a study site, measuring the 137 Cs, 210 Pb ex , 
or 7 Be inventories, and applying a conversion model, it 
is possible to derive point estimates of the erosion and 
deposition rates associated with the cores. The time 
base of the erosion rate estimates will depend on the 
radionuclide used. These values can then be used to 
establish the spatial pattern of soil redistribution, and 
by integrating the values across the study site, it is 
possible to calculate the gross and net erosion and the 
sediment delivery ratio. The latter represents the pro¬ 
portion of the eroded soil that is transported out of the 
study site and provides important information on the 
likelihood of eroded soil reaching watercourses. By 
making measurements of more than one radionuclide, 
it is often possible to obtain additional information, by 


virtue of the different half-lives or fallout histories of 
the radionuclides. Figure 14 presents the results of an 
investigation of rates and patterns of soil redistribution 
within a 6.7-ha field at Higher Walton Farm near 
Crediton in Devon, UK [33]. The area covered by the 
field represents a valley head depression with slopes of 
up to 15° converging toward its outlet. In this case, 
measurements of both 137 Cs and 7 Be activities were 
undertaken, with the former providing estimates of 
average rates of soil redistribution over the past ~45 
years, and the latter estimates of the erosion rates 
associated with a particular period of heavy rainfall 
(69 mm in 7 days) that occurred in early January 1998. 

In this study, the soil cores used for the 137 Cs and 
7 Be measurements were collected during two separate 
campaigns, although they could have been collected 
together. In both cases, the cores were collected at the 
intersections of a 20 m x 20 m grid, resulting in a suite 
of ~140 cores from each sampling campaign. Cores 
were also collected from adjacent areas of undisturbed 
land, which were not expected to have experienced 
either erosion or deposition, in order to establish the 
reference inventory. For the 137 Cs measurements, the 
cores were collected in August 1996, using a motorized 
percussion corer equipped with a 6.9-cm internal 
diameter steel core tube, which was inserted into the 
soil to a depth of ~60 cm. The cores used for the 7 Be 
measurements were, in contrast, much shallower and 
were collected manually to depths of 3-5 cm using 
a 15-cm diameter plastic core tube, in January 1998. 
During the preceding spring/summer of 1997, the field 
had been cultivated and sown to maize and the crop 
was harvested in early November 1997, when the soil 
was compacted by the harvesting equipment. After 
harvesting, the field was left bare and uncultivated 
over the winter and the period of heavy rainfall in 
early January 1998 resulted in substantial surface run¬ 
off and soil erosion. The pattern of 137 Cs inventories 
documented for the study field is presented in Fig. 14a. 
The cores collected from the adjacent undisturbed sites 
provided an estimate of the local 137 Cs reference inven¬ 
tory at the time of sampling of ~2,500 Bq m -2 and the 
pattern of 137 Cs inventories shows clear evidence of 
both erosion (reduced inventories) and deposition 
(increased inventories). Use of a mass balance conver¬ 
sion model incorporating the effects of soil redistribu¬ 
tion by tillage enabled estimates of mean annual soil 
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137 Cs inventory 
(Bq nr 2 ) 

2,600 
2,333 
2,067 
1,800 


7 Be inventory 
(Bq nr 2 ) 
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200 
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soil redistribution rate 
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soil redistribution rate 
(kg rrr 2 year 1 ) 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 14 

The spatial distribution of 137 Cs and 7 Be inventories within a field at Higher Walton Farm, near Crediton, UK (a) and of the 
estimates of soil redistribution rates derived from these inventories (b) 


redistribution rates to be derived from the measured 
inventories. These soil redistribution rates relate to the 
period of ca. ~45 years prior to the sampling campaign 
in 1996. The pattern shown has been mapped in 


Fig. 14b and the data have been summarized in Table 3, 
which presents values for the range of the documented 
soil redistribution rates, the mean erosion rate for the 
eroding areas, the mean deposition rate for the 
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depositional areas, the net soil loss from the field, and 
the sediment delivery ratio. 

The spatial distribution of 7 Be inventories within 
the study field measured at the end of the period of 
heavy rainfall in early January 1998 is also presented in 
Fig. 14a. The equivalent value for the local reference 
inventory was estimated to be 533 Bq m -2 and the 
pattern depicted provides clear evidence of areas with 
both reduced and increased inventories, which in turn 
reflect both erosion and deposition within the field. In 
order to interpret this pattern in terms of soil redistri¬ 
bution rates associated with the period of heavy rainfall 
in early January, it was important to consider the extent 
to which it might reflect spatial variability inherited 
from previous erosion events. However, the preceding 
autumn and early winter had been relatively dry and 
there was no evidence of surface erosion having 
occurred during the previous 6 months. In this situa¬ 
tion, the 7 Be inventory within the field immediately 
prior to the period of heavy rainfall in early January 
1998 could be expected to be spatially uniform, since it 
would reflect the wet fallout associated with rainfall 
over the preceding months. Any spatial variability pro¬ 
duced by soil redistribution associated with erosion 
events much earlier in 1997 would have been rendered 
insignificant by radioactive decay, due to the short half- 
life of 7 Be. It was therefore possible to assume that the 
spatial variability in 7 Be inventories within the study 
field evident in Fig. 14a reflects soil redistribution asso¬ 
ciated with the period of heavy rainfall in early January 
1998. By using the conversion model for 7 Be measure¬ 
ments described previously in this chapter (i.e., Eq. 7), 
which takes account of the increase or decrease in 
inventory relative to the reference inventory and the 
depth distribution of 7 Be in uneroded soil within the 
field, it was possible to estimate the soil redistribution 
rates associated with the period of heavy rainfall pre¬ 
ceding the sampling campaign. The pattern of soil 
redistribution rates demonstrated by these results is 
presented in Fig. 14b and summary data, equivalent 
to that provided for the 137 Cs measurements, are listed 
in Table 3. The soil redistribution rates (kg m -2 ) asso¬ 
ciated with the period of heavy rainfall in early January 
1998 estimated from the 7 Be measurements are sub¬ 
stantially higher than the equivalent longer-term mean 
annual soil redistribution rates estimated using the 
137 Cs measurements. However, the sediment delivery 


Fallout Radionuclides and the Study of Erosion and 
Sedimentation. Table 3 A comparison of rates of soil 
redistribution within a field at Higher Walton Farm, near 
Crediton, UK, estimated using the 137 Cs and 7 Be measure¬ 
ments made on soil cores collected from the field 


Measure 

137 Cs (kg m 2 
year ’) 

7 Be (kg m 2 ) 

Range 

-4.5 to 2.0 

-11.9 to 9.8 

Mean erosion rate for 

-1.1 

-5.3 

eroding area 



Mean deposition rate for 

0.69 

4.0 

depositional area 



Net soil loss 

-0.48 

-2.5 

Sediment delivery ratio 

0.83 

0.80 


Based on Walling et al. [33] 


ratios are closely similar, indicating that ^80% of the 
eroded sediment was transported out of the field. 
The high sediment redistribution rates associated 
with the period of heavy rainfall in early January 1998 
reflect both the extreme nature of this period of rainfall 
and, perhaps more importantly, the condition of the 
field, which having been compacted by the maize 
harvesting machinery and left bare after the harvest 
was particularly susceptible to surface runoff and ero¬ 
sion. Such results underscore the potential significance 
of a small number of extreme events and the incidence 
of particular land use conditions in controlling erosion 
from the study field. 

Soil Erosion: A Countrywide Survey 

A key feature of most existing studies involving the use 
of measurements of 137 Cs and other fallout radionu¬ 
clides in soil erosion investigations has been the 
emphasis on small areas, frequently individual fields. 
It has been argued that if fallout radionuclides are to be 
more widely used as an operational tool to support 
policy, as distinct from a research tool, there is a need 
to upscale their application. Analytical constraints, 
including the relative long count times and the sub¬ 
stantial cost of gamma spectrometers, which inevitably 
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limit the number of detectors available in most labora¬ 
tories, means that it is not feasible to upscale by simply 
collecting and analyzing a greatly increased number of 
samples. There is a need to devise novel sampling 
strategies that permit the use of 137 Cs measurements 
at the reconnaissance scale. Work by Loughran and 
coworkers [92] in Australia attempted to develop 
a national soil erosion survey based on 137 Cs measure¬ 
ments, and, in this case, 206 sites located in many 
different parts of the country were sampled using single 
downslope transects comprising about 20 sampling 
points. Recent work in the UK [158] has attempted to 
develop this approach further by reducing the number 
of samples collected from a site, while still obtaining 
a representative estimate of the gross and net erosion 
rate and thereby increasing the number of sites for 
which data can be assembled. By selecting fields of 
similar size and simple topography, but representative 
of a wide range of conditions known to influence soil 
erosion rates, including soil texture, slope steepness, 
effective rainfall and land use, it was possible to com¬ 
mence the assembly of a national scale database on soil 
erosion rates on agricultural land. 

In the initial stages of this study, detailed investiga¬ 
tions were undertaken in a number of fields to confirm 
the potential for using a small number of samples (e.g., 
[10-15]) collected along a carefully positioned down- 
slope transect to provide a meaningful assessment of 
the gross and net erosion rates in a given field. 


The information on 137 Cs inventories provided by the 
transects was used to estimate soil redistribution rates, 
using a conventional mass balance model and the dif¬ 
fusion and migration model for cultivated and non- 
cultivated fields, respectively. Estimates of the gross and 
net erosion rates for individual fields were derived by 
integrating the resulting estimates of soil redistribution 
rate along the length of the transect. Comparison of the 
results provided by the single transect with those pro¬ 
vided by much denser grid or transect sampling net¬ 
works, using the same conversion models, confirmed 
the validity of the approach and indicated that 
a transect comprising ca. 10-15 sampling points was 
capable of providing a reliable estimate of the gross and 
net erosion rates within individual fields. 

A national sampling campaign was subsequently 
undertaken to obtain estimates of gross and net erosion 
rates for 248 fields located in different areas of England 
and Wales. The data collected are summarized in 
Fig. 15, which distinguishes the erosion rate estimates 
obtained for arable and pasture fields. These data have 
provided a basis for assessing the range of erosion rates 
occurring on agricultural land in England and Wales. 
The gross and net erosion rates documented for arable 
fields ranged between 1.2 and 29.31 ha -1 year -1 and 0.0 
and 27.3 t ha -1 year -1 , with median rates of 6.6 and 
5.2 t ha -1 year -1 , respectively. The equivalent ranges 
for pasture were 0.7-13.0 and 0.0-11.7 t ha -1 year -1 , 
with median values of gross and net erosion of 2.6 and 


Arable fields Pasture fields 



a Erosion rate (t ha 1 year : ) b Erosion rate (t ha 1 year 1 ) 

Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 15 

Frequency distributions of mean annual gross and net erosion rates for the arable and pasture fields in England and Wales 
sampled in the national survey undertaken by Walling and Zhang (2010) [158] 
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1.8 t ha -1 year -1 , respectively. The ratio of net to gross 
erosion, which provides a useful measure of potential 
sediment delivery to watercourses, was typically >0.6 
for arable fields and <0.6 for pasture fields, emphasiz¬ 
ing the lower efficiency of sediment delivery from pas¬ 
ture areas. Comparison of the erosion rates obtained 
from the study with existing erosion hazard assess¬ 
ments, based on soil texture, confirmed the broad 
consistency of the results. The assembled dataset also 
provided a basis for exploring the influence of a range 
of potential controlling variables on soil erosion rates, 
using statistical tests. No very strong relationships 
existed, but several statistically significant relationships 
between the erosion rate variables and slope gradient 
and soil texture were found. The data have also been 
used to develop a typology for extrapolating the assem¬ 
bled information on gross and net erosion rates, based 
on land use, soil texture, and slope steepness and this 
has been employed within a GIS to produce national 
scale maps of gross and net rates of soil loss from 
agricultural land in England and Wales and a prelimi¬ 
nary national scale inventory of soil loss for England 
and Wales. 

Documenting Floodplain Sedimentation Rates 

Overbank deposition on river floodplains during flood 
events represents an important potential sink for 
suspended sediment transported through a river sys¬ 
tem, and recent studies have demonstrated that such 
conveyance losses can be as high as ~40% of the 
suspended sediment load delivered to the main channel 
system [159]. Where attempts are made to reduce 
downstream sediment loads through improved land 
management and other sediment control measures, it 
can be important to document rates of overbank sed¬ 
imentation on river floodplains, in order to quantify 
the potential magnitude of this sediment sink. In addi¬ 
tion to representing a conveyance loss, the sediment 
deposited on the floodplain can also give rise to environ¬ 
mental problems, if it contains high levels of contami¬ 
nants, such as heavy metals, or nutrients, particularly 
phosphorus. Again, information on rates of overbank 
sedimentation can represent an important requirement 
to assess such hazards. Since overbank inundation is 
commonly infrequent and difficult to forecast, direct 
measurements of associated sedimentation rates, using 


sediment traps and related techniques, can prove difficult 
to implement. Furthermore, there will often be a need to 
establish longer-term rates of sedimentation to take 
account of the frequency of floods of different magni¬ 
tude. In this context, the use of 137 Cs and 210 Pb ex mea¬ 
surements to document longer-term mean annual 
sedimentation rates offers considerable potential. As 
an example, Fig. 16 shows how 137 Cs and 210 Pb ex mea¬ 
surements were used to document overbank sedimen¬ 
tation rates along a short reach of the floodplain of the 
River Severn near Buildwas in Shropshire, UK. In this 
study, 124 sediment cores were collected at the inter¬ 
sections of a 25 m x 25 m grid using a motorized 
percussion corer equipped with a 6.9-cm internal 
diameter core tube. Cores were collected to a depth of 
~70 cm to ensure that they included the complete 
137 Cs and unsupported 210 Pb profiles. Measurements 
of the 137 Cs and 210 Pb ex inventories of the cores were 
used to estimate the mean annual sedimentation rates 
at the coring points using the procedures documented 
by [118, 122] and described previously. In the case of 
137 Cs, the estimate of the sedimentation rate derived 
from a sectioned core and the depth of the 137 Cs peak 
(see Eq. 8) was extrapolated to the other sampling 
points where bulk cores were collected, using the pro¬ 
cedure represented in Eq. 10. In the case of the 210 Pbex 
measurements, the values of total inventory obtained 
for the individual bulk cores collected from the sam¬ 
pling points were used with the CICCS model (Eq. 15) 
to estimate the sedimentation rates at those points. 
These estimates of sedimentation rate were used to 
map the patterns of sedimentation within the reach 
presented in Fig. 16. These patterns reflect the interac¬ 
tion of a number of controls on the sedimentation rate, 
including depth of inundation, distance from the chan¬ 
nel, and the local microtopography. It is also possible to 
compare the sedimentation rate estimates derived from 
the 137 Cs measurements, which relate to the past ~40 
years, with those based on the unsupported 210 Pb mea¬ 
surements, which relate to the past ~100 years, in order 
to assess longer-term changes in sedimentation at this 
location. The mean annual sedimentation rate at this 
site over the past 40 years is 0.28 g cm -2 year -1 , 
whereas the equivalent rate for the past 100 years is 
0.33 g cm -2 year -1 . This suggests that rates of overbank 
sedimentation have decreased slightly toward the 
present. This decrease could reflect changes in the 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 16 

The spatial distribution of overbank sedimentation rates within a small reach of the floodplain of the River Severn near 
Buildwas, Shropshire, UK derived from 137 Cs and 210 Pb ex measurements undertaken on floodplain cores 


magnitude and frequency of flood inundation, but it 
might also reflect a progressive reduction in inundation 
depths as the level of the floodplain surface gradually 
increased in height as a result of overbank deposition. 

Using Floodplain Deposits to Reconstruct Changes 
in the Properties of Sediment Transported by a River 

Where post-depositional changes in sediment geo¬ 
chemistry can be assumed to be of limited importance, 


it is possible to use downcore changes in the geochem¬ 
istry of overbank deposits to reconstruct changes in the 
properties of the suspended sediment transported by 
a river. The overbank deposits effectively preserve 
a record of the suspended sediment transported by 
the river in the recent past [160] and fallout radionu¬ 
clides can be used to provide information on the chro¬ 
nology of such deposits. A useful example of this 
approach is provided by a study undertaken by the 
author and his coworkers on the floodplain bordering 
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the main channel of the River Axe, which drains a 
303 km 2 catchment in Devon, UK, aimed at 
reconstructing changes in the phosphorus content of 
the deposited sediment. Floodplain cores were col¬ 
lected from three representative sites in the upper, 
middle, and lower reaches of the river basin (see 
Fig. 17a) and measurements of both the 137 Cs activity 
and the total-P content of the sectioned cores were 
undertaken, in order to establish the associated depth 
profiles. The total-P profiles shown in Fig. 17 indicate 
that the total-P content of sediment deposited on the 
river floodplains at the three sites has more than dou¬ 
bled in recent decades, in response to increased fertil¬ 
izer application and livestock densities in the upstream 
catchments and increased effluent discharges from 
sewage works and related point sources. The 137 Cs 
measurements were used to establish an approximate 
chronology for these changes by defining the level of 
the floodplain surface in 1963, the year of peak 137 Cs 
fallout (Fig. 17b), and assuming a constant annual 
sedimentation rate. This chronology was used to derive 
the plots of the changing total-P content of deposited 
sediment since 1950 presented in Fig. 17c. At all three 
sites, the total-P content of deposited sediment has 
increased by more than 100% over the past 50 years, 
but higher concentrations are found in the middle and 
lower reaches of the catchment where agriculture is 
more intensive and effluent discharges are greater. 
Although not clearly defined, there is some evidence 
of a reduced total-P content in deposited sediment in 
the last 10 years and this could reflect a response to 
improved land management and treatment of waste 
water discharges. Information of this nature can afford 
a valuable basis for defining baseline levels for fluvial 
sediment properties and assessing the impact of chang¬ 
ing land management and wastewater treatment in 
reducing the inputs of P and other sediment-associated 
nutrients and contaminants to river systems. 

Use of Fallout Radionuclides for Sediment Source 
Fingerprinting: An Example from the UK 

The siltation of spawning gravels has been increasingly 
identified as a key factor contributing to the declining 
success of salmon fisheries in British rivers. Concern for 
this problem has focused attention on the need to 
reduce gravel siltation and thus to reduce fine sediment 


mobilization and transport in impacted catchments, 
through the establishment of sediment control pro¬ 
grams. The development of effective sediment control 
programs requires information on the likely sources of 
the fine sediment accumulating in spawning gravels, 
since these sources must be targeted if the control 
program is to prove effective. In an attempt to provide 
such information, a reconnaissance source fingerprint¬ 
ing survey of several representative rivers, located in 
different parts of Britain, was undertaken by the author 
and his coworkers in collaboration with the Environ¬ 
ment Agency [161]. 

Samples of interstitial fine sediment were recovered 
from salmonid spawning gravels within a representa¬ 
tive selection of rivers in England and Wales (Fig. 18), 
during a national fieldwork program conducted by the 
Environment Agency over the period 1999-2000. Sam¬ 
ple collection was based on the use of retrievable basket 
samplers, which were installed in artificial redds 
constructed in spawning gravels at representative loca¬ 
tions. Between one and five basket samplers were 
installed in each of the rivers identified on Fig. 18. 
The basket samplers were filled with clean framework 
gravel (>6.4 mm) prior to their installation, and they 
were retrieved ca. 3 months later. The gravel contained 
within the basket was subsequently wet sieved to 
recover the fine (<0.125 mm) interstitial sediment 
that had accumulated within the gravel during the 
period of deployment and this fraction was used for 
sediment source fingerprinting. By virtue of the recon¬ 
naissance nature of the study, which included 18 catch¬ 
ments, sampling of potential source materials focused 
on the broad distinction between surface and channel 
bank/subsurface sources, and a total of 672 source 
material samples were collected from the different 
study areas. These samples were sieved to <0.125 mm 
to facilitate direct comparison with the samples of 
fine interstitial sediment. The limited resources avail¬ 
able to the study also precluded the use of an extensive 
range of fingerprint properties to discriminate the 
two potential sources and emphasis was placed on the 
use of radiometric ( 137 Cs, 210 Pb ex , 226 Ra) measure¬ 
ments, coupled with information on the organic prop¬ 
erties (C and N content) of the potential sources. Mean 
values of these properties were used to characterize 
the two potential sources in each of the river basins 
investigated. 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 17 

The changing total-P content of fine sediment deposited on the floodplain of the River Axe over the past 50 years, showing 
(a) the location of the coring points, (b) the total-P depth distributions, and (c) the reconstructed trends in the total-P 
content of fine sediment deposited on the floodplain over the past 50 years 


A multicomponent mixing model, incorporating 
correction for the effects of contrasts in particle size 
and organic matter content between the samples of 
interstitial sediment and the source material, was used 
to estimate the relative contributions of surface and 
channel bank/subsurface sources to the fine sediment 
recovered from the spawning gravels. The results of 
these computations for different regions of England 


and Wales are presented in Fig. 18. Significant contrasts 
in the relative contributions of the two sources are 
apparent between the regions, with, for example, sur¬ 
face sources accounting for >90% of the fine interstitial 
sediment in north Wales and southern England, but 
only 16% and 39% in south-west England and south 
and mid Wales, respectively. These regional contrasts 
reflect the interaction of land use and both erosion 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 18 

Fingerprinting the source of fine sediment accumulating in salmon spawning gravels, showing the location of the 
sampled rivers and regional contrasts in the source of the fine sediment (Based on [161]) 


processes and the efficiency of sediment transfer to the 
channel network. In south-west England, where stock¬ 
ing densities are frequently high, drainage densities are 
relatively high and river channels are generally incised, 


a combination of livestock trampling of channel banks 
and erosion of unstable well-developed vertical channel 
banks mean that channel and subsurface sources are 
the dominant source of fine interstitial sediment. 
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In contrast, the greater importance of arable farming 
and more specifically soil erosion on large cultivated 
fields with few boundaries to interrupt slope-channel 
connectivity, combined with the relative stability of 
the well-vegetated channel banks, result in surface 
soils providing the dominant source (94%) of the sam¬ 
ples of fine interstitial sediment recovered from 
spawning gravels in southern England. Surface sources 
are also important in the upland areas of northern 
England and north Wales, where high rainfall and 
grazing pressure promote the erosion of surface soils 
under upland pasture or moorland and the steep 
topography combined with an absence of field bound¬ 
aries results in the efficient routing of sediment to the 
river channels. 

The contrasts in the relative importance of surface 
and channel bank/subsurface sources in different 
regions of England and Wales outlined above have 
important implications for the design and implemen¬ 
tation of effective sediment control strategies. Where 
bank erosion is the dominant source, attention should 
clearly focus on reduction of livestock trampling of 
channel margins and improvement of bank stability 
(e.g., by fencing and revegetation of channel margins). 
However, such measures are likely to be of little value in 
areas where surface sources are dominant and where 
emphasis should be placed on controlling sediment 
mobilization and transfer within the catchment more 
generally. 

Use of Fallout Radionuclides for Sediment Source 
Fingerprinting: An Example from China 

Another example of the potential for using environ¬ 
mental radionuclides, and more particularly 137 Cs, for 
fingerprinting sediment sources is drawn from work 
in the rolling loess plateau region of Shaanxi Province, 
China [ 162] . The landscape in the study area comprises 
two contrasting geomorphological units, representing 
the rolling plateau surface and the gully systems which 
dissect the plateau surface The very high sediment 
yields in this environment (~ 10,000 t km -2 year -1 ) 
cause major downstream problems in terms of reser¬ 
voir and channel siltation, and this has focused atten¬ 
tion on the need to reduce sediment yields through soil 
and water conservation and sediment control mea¬ 
sures. In planning such measures, there is a need to 


establish the relative importance of soil erosion 
from the cultivated areas of the rolling plateau and 
erosion occurring within the gully systems, in contrib¬ 
uting to downstream sediment yields, since these 
two potential sediment sources require very different 
approaches to reduction in sediment mobilization 
and transfer. 

A study aimed at establishing the relative impor¬ 
tance of the two main sediment sources was under¬ 
taken in the 3.86 km 2 Zhaojia gully catchment, where 
the plateau surface and gully systems were of similar 
spatial extent and accounted for 53% and 47% of the 
catchment area respectively (Fig. 19). A check dam had 
been constructed across the channel in the middle 
portion of the catchment in early 1991 (Fig. 19), and 
when the catchment was visited in October 1993, the 
deposits associated with two floods occurring during 
the period July-August 1993 were clearly distinguish¬ 
able in sediment cores collected from the area of sedi¬ 
ment deposition above the check dam. The total depth 
of sediment deposition associated with the two events 
was ~33 cm. Based on an analysis of three cores, the 
mean 137 Cs content of sediment deposited by the first 
flood was 0.74 Bq kg -1 , whereas that for the second 
flood was 1.06 Bq kg -1 . In attempting to decipher the 
main sources of the deposited sediment and thus the 
sediment yield from the catchment, attention was 
directed to variations in the 137 Cs concentration of 
potential sources within the two main sediment sources 
areas. Mean 137 Cs concentrations associated with these 
sources exhibited significant variations and offered an 
effective means of discriminating the sources. 
A summary of the 137 Cs content of potential source 
materials in the Zhaojia Gully catchment is provided in 
Table 4. 

When the average 137 Cs content of the sediment 
deposits associated with the two floods occurring in 
1993 was compared with that of potential source mate¬ 
rial collected shortly after the flood (cf. Table 4), it was 
found to be similar to that of cultivated soil from 
the steep gully slopes, lower than that of soil from the 
cultivated land on the plateau crests and steeper lower 
slopes of the rolling plateau, and considerably lower 
than that of topsoil from the grass covered areas. It was 
also very substantially higher than that of surface mate¬ 
rial from steep bare slopes and from recent mass move¬ 
ment deposits within the gully area. In view of either 
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Fallout Radionuclides and the Study of Erosion and Sedimentation. Figure 19 

The Zhaojia Gully catchment showing its location (a) and its main features (b). 1, catchment divide; 2, the boundary 
between the rolling plateau and the gully areas; 3, the main channel; 4, the check dam; 5, area of sediment deposition 
associated with the check dam; 6, divide of the catchment contributing to the sediment trap reservoir; 7, sampling sites for 
sediment deposits; 8, village (Based on [162]) 


Fallout Radionuclides and the Study of Erosion and Sedimentation. Table 4 The 137 Cs content of potential source 
materials in the Zhaojia Gully catchment from different topographic locations and under different land use conditions 


Zone 

Source material and land type 

Catchment 
area (in %) 

No. of 
samples 

Mean 137 Cs content 
(Bq kg' 1 ) 

Erosional behavior 

Gully 

(47%) 

Surface soil on grassland of gully 
slopes 

21 

8 

7.0 

Slight sheet and rill 
erosion 


Plowed soil on steep gully slopes 

2 

7 

1.2 

Very severe sheet and 
rill erosion 


Surface soil on bare slopes and 
mass movement deposits 

19 

7 

0.02 

Very severe sheet and 
rill erosion on bare 
slopes and active mass 
movements 


Village area, cultivated land on 
terrace and gully floor and tracks 

5 

- 

- 


Rolling 

plateau 

Cultivated soil of crest slopes 

14 

36 

6.4 

Moderately severe 
sheet and rill erosion 

(53%) 

Cultivated soil of lower slopes 

36 

32 

3.4 

Severe sheet and rill 
erosion 


Surface soil on grassland 

3 

- 

- 



Based on Walling et al. [162] 
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their limited extent or their low erosion rates, the 
cultivated land on the steep gully slopes and the grass¬ 
land within the gully area were deemed unlikely to 
represent important sediment sources and the culti¬ 
vated areas of the rolling plateau and the steep bare 
slopes and recent mass movement deposits within the 
gully area were identified as representing the primary 
sediment sources. Taking account of the relative extents 
of the cultivated areas on the crests and the steeper 
upper slopes of the rolling plateau and estimates 
of erosion rates obtained from these areas using 137 Cs 
measurements, a weighted average 137 Cs content for 
sediment mobilized from these cultivated areas of the 
rolling plateau of 3.9 Bq kg -1 was estimated. 

Assuming that the average 137 Cs content of surface 
material from the steep bare gully slopes and recent 
mass movement deposits (0.02 Bq kg -1 ) was represen¬ 
tative of sediment from the gully area and that the 
minor contributions from the cultivated areas and 
areas of grass cover within the gully could be ignored, 
a simple mixing model was used to estimate the relative 
contribution of sediment derived from the gully area 
and the cultivated areas of the rolling plateau to the 
flood deposits. The results provided by the mixing 
model are presented in Table 5. These indicate that 
during the first flood, 18.6% of the sediment was 
derived from the rolling plateau and 81.4% from the 
gully area. During the second event, the equivalent 
values were 26.8% and 73.2%. Taking the two events 
together, the values were 23% from the rolling plateau 
and 77% from the gully area. These results emphasized 
the importance of the gully area and the associated 
gully erosion and mass movement processes as the 
main sediment source within the Zhaojia Gully catch¬ 
ment and clearly demonstrated that any strategy aimed 
at reducing sediment yields from this catchment and 


from similar landscapes in the local area should focus 
on reducing sediment production within the gully area. 
Such measures could include check dams and other 
gully stabilization measures. 

Future Directions 

As demonstrated by the above overview, the use of 
fallout radionuclides in soil erosion and sedimentation 
investigations now has a history that stretches back 
over more than 30 years. Over this period, there have 
been advances in measurement technology and 
whereas early work primarily focused on the use of 
137 Cs, 210 Pb ex and 7 Be are now also quite widely used 
both as an alternative and a complement to 137 Cs. 
Equally, whereas most early work focused on 
documenting patterns and rates of soil erosion and 
establishing chronologies for lake cores, subsequent 
work has expanded the various applications to include 
other sediment sinks, particularly river floodplains and 
wetlands, and sediment source fingerprinting. These 
expanding applications have been promoted by two 
key drivers. The first is the growing recognition of 
both the many environmental problems associated 
with the mobilization and transfer of fine sediment 
within the landscape and the need to document and 
develop an improved understanding of the source, 
mobilization, and transfer of fine sediment, in order 
to develop effective management and control strate¬ 
gies. The second is the essentially unique information 
that can be provided by the use of fallout radionuclides 
as tracers and the need to develop the use of tracers to 
complement more traditional monitoring techniques 
[ 163] . Future developments are likely to expand further 
this field of work and four areas in particular can be 
identified as potential areas for expansion or advances. 


Fallout Radionuclides and the Study of Erosion and Sedimentation. Table 5 The relative contribution of sediment 
from the rolling plateau and gully areas of the Zhaojia Gully catchment during the floods of 1993 


Flood 

Date of flood deposit 

Deposit thickness (cm) 

137 Cs content of sediment 
deposits (Bq kg” 1 ) 

Relative contribution (%) 

Rolling plateau Gully 

2 

August 21,1993 

18.3 

1.06 

26.8 

73.2 

1 

July 21-August 1,1993 

14.7 

0.74 

18.6 

81.4 


Based on Walling et al. [162] 
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These include: firstly, the use of other radionuclides; 
secondly, the further development and improvement of 
measurement techniques; thirdly, greater integration of 
the various applications and approaches; and finally, 
the progressive incorporation of fallout radionuclide 
tracers into sediment-based catchment investigations, 
as an operational rather than as a research tool. Each of 
these future directions will be briefly considered. 

Use of Other Radionuclides 

137 Cs, 210 Pb ex , and 7 Be are now firmly established as the 
key fallout radionuclides for use in sediment tracing 
investigations. Their different half-lives and 
contrasting fallout patterns usefully complement each 
other. However, in recent years, some attention has 
been given to the potential for using Plutonium-239 
and 240 ( 239,240 Pu) as an alternative to 137 Cs [ 164-167] . 
These radionuclides are also a product of bomb testing 
and have been shown to demonstrate very similar 
behavior to 137 Cs in terms of their distribution within 
the soil profile [168], their post-fallout redistribution 
within the landscape and thus their potential for study¬ 
ing soil redistribution, [169, 170], dating sediment 
cores [167, 171, 172], and fingerprinting sediment 
sources [166]. However, some contrasts with 137 Cs in 
terms of their post-fallout redistribution have been 
reported [173] and it has been suggested that 
239,240p u * s to be i ess mo bil e than 137 Cs in sedi¬ 
ment cores [167]. 

The likely advantages of 239,240 Pu over 137 Cs lie 
primarily in their longer half-lives (24,110 and 
6,561 years, respectively) and secondarily in analytical 
requirements. In the first context, the relatively short 
half-life of 137 Cs (30.1 years) means that activities of 
this radionuclide are beginning to decline significantly 
in the environment, since it is now almost 50 years 
since the period of peak fallout around 1963. Where 
fallout fluxes were relatively low, as, for example, in the 
southern hemisphere, this can represent a problem for 
gamma spectrometry analysis. Reduced activities mean 
longer count times and are likely to be coupled with 
reduced measurement precision and increased mini¬ 
mum levels of detection. With their long half-lives, 
239,240p u not f ace p ro blem. In addition, 
239,240p u can proyijg additional chronomarkers asso¬ 
ciated with changes in the Pu isotopic composition of 


fallout during the 1950s and 1960s. In particular, the 
238 Pu/ 239+240 Pu activity ratio shows distinct shifts in 
the early 1950s and the mid-to-late 1960s, providing 
important known-age horizons in southern hemi¬ 
sphere sediments. In the second context, recent 
advances in analytical techniques have provided the 
possibility of analyzing 239 ’ 240 p u by mass spectrometry 
(ICPMS and AMS), thereby providing a means of 
greatly increasing sample throughput, relative to 
137 Cs, where individual samples collected in areas 
with low fallout inventories can require a count time 
of ~24 h or more. A further feature of 239,240 Pu fallout 
is that, unlike 137 Cs, it was not associated with both 
bomb and Chernobyl fallout. It was only associated 
with the former. In some areas of the world affected 
by Chernobyl fallout, 239,240 Pu may therefore offer 
opportunities to distinguish fallout inputs from both 
sources [174] and to exploit their differences in timing 
to obtain additional information on the erosional 
history of a study area. 

Another radionuclide which has attracted attention 
for its use as an alternative to 137 Cs in dating lake cores, 
due to the likelihood of reduced mobility, is 241 Am. 
241 Am is a product of the decay of 241 Pu from bomb 
fallout and therefore has the advantage that activities 
are currently increasing rather than decreasing. 

Development and Improvement of Measurement 
Technology 

It is possible that advances in technology might also 
lead to alternatives to gamma spectrometry for the 
analysis of 137 Cs, 7 Be, and 210 Pb. The potential to 
increase sample throughput would offer considerable 
scope for increasing the size of the area that can be 
investigated and/or the spatial resolution of the results. 
Other areas of measurement technology, where advances 
have begun to occur, include the remote sensing of 
spatial patterns of radionuclide inventories using air¬ 
borne sensors. The successful use of this approach for 
137 Cs has been reported in Northern England and 
Northern Ireland [175, 176]. As yet, the spatial resolu¬ 
tion is limited and the approach is more suited 
to documenting spatial variability of the original fall¬ 
out input rather than post-fallout redistribution. Fur¬ 
thermore, its use may be restricted to areas receiving 
additional Chernobyl fallout where 137 Cs inventories 
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are higher. However, the possibility of collecting 
detailed information on the spatial distribution of 
inventories across an area would clearly introduce 
important new opportunities for expanding the scale 
of sediment redistribution investigations. 

Increased Integration of Radionuclides and 
Approaches 

Two important features of the use of fallout radionu¬ 
clides as tracers in erosion and sedimentation investi¬ 
gations are: firstly, that conjunctive use of different 
radionuclides can provide increased information on 
the functioning of the system under investigation, and 
secondly, that it is possible to use the same tracers to 
document various components of a catchment sedi¬ 
ment budget, for example, soil redistribution rates on 
the slopes; the deposition and storage of fine sediment 
on downstream floodplains; the dynamics of sediment 
mobilization, deposition, and remobilization within 
the channel system; and the relative contribution of 
key sediment sources to the sediment output from the 
catchment. Furthermore, where a long-term sink, such 
as a lake or well-developed floodplain or wetland, exists 
at the outlet of the catchment, it is possible to use the 
same radionuclides to establish a chronology for sedi¬ 
ment cores to assist in interpreting the erosional history 
of the catchment. The potential for conjunctive use of 
several radionuclides is being increasingly recognized, 
and Blake and coworkers [177] provide an interesting 
example of this potential in their study of sediment 
redistribution by erosion after a wildfire at a site in 
Australia. Equally, the potential for integrating work 
on several different components of the sediment bud¬ 
get is provided by Zhang and Walling [178] who 
showed how it is possible to use the information pro¬ 
vided by 137 Cs depth distributions in lake and reservoir 
sediments to provide information on both erosion 
rates in the contributing catchments and the dominant 
sediment sources. 

To date, however, studies involving different com¬ 
ponents of the sediment budget have often been com¬ 
partmentalized and considerable scope exists to 
integrate them more closely to provide a comprehen¬ 
sive assessment of the catchment sediment budget. In 
this context, there will be a need to make use of other 
measurements and approaches to complete the picture, 


but fallout radionuclide measurements can provide the 
key to establishing the sediment budget of a catchment 
[163, 179-181]. As indicated previously, the catchment 
sediment budget represents a key source of information 
for designing and implementing effective sediment 
management strategies for catchments and river basins 
[ 182] . Further work is required to integrate the various 
applications of fallout radionuclides in sediment bud¬ 
get investigations and to develop the comprehensive 
toolbox that will be required to exploit their full 
potential. 

Moving from a Research Tool to an Operational and 
Management Tool 

To date, most of the studies aimed at exploring and 
developing the potential for using fallout radionuclides 
in studies of erosion and sedimentation have been 
curiosity-driven research investigations. In some 
instances, the starting point has been the potential 
application, which has then been demonstrated. In 
others, the starting point has been the need to solve 
a problem or to obtain an improved understanding of 
a particular part of the system, and fallout radionu¬ 
clides have been successfully used as the tool to achieve 
that objective. However, as the comprehensive toolbox 
referred to above is progressively developed, it is to be 
hoped that the wide-ranging potential of the approach 
will gain increasing recognition and that the use of 
fallout radionuclides will become more routine and 
thus a more widely used tool to support the sustainable 
management of the environment. 
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be both inorganic and organic, but this contribution 
refers mostly to inorganic or manufactured fertilizers, 
since these provide most of the nutrients now added to 
soil and crops. 


Glossary 

Essential nutrients There are 16 mineral elements that 
are essential for plant growth. They are referred to 
here by their symbols: nitrogen, N; phosphorus, P; 
potassium, K; calcium, Ca; magnesium, Mg; sulfur, 
S; boron, B; iron, Fe; manganese, Mn; copper, Cu; 
zinc, Zn; molybdenum, Mo; sodium, Na; chlorine, 
Cl; silicon, Si; nickel, Ni. 

Fertilizer efficiency This refers to the additional grain 
or other agricultural product produced per unit of 
additional nutrient applied in fertilizer. A related 
concept is Fertilizer Recovery, which is the addi¬ 
tional mass of a nutrient in the aboveground parts 
of a crop expressed as a proportion of the additional 
nutrient applied in fertilizer. 

Macronutrients N, P, and K. 

Micronutrients Cl, Fe, Mn, B, Zn, Cu, Mo, and Ni. 
Soil microbial processes These include mineraliza¬ 
tion, which is oxidation of organic matter and 
the release of mineral nutrients; immobiliza¬ 
tion, which is the reverse process of incorporating 
mineral nutrients into organic matter; and nitri¬ 
fication, which is conversion of ammonium to 
nitrate. 

Definition of the Subject 

Fertilizers are compounds or mixtures delivered as 
solids, liquids or gases, that supply essential nutrients 
to crops in soluble forms that are convenient and safe to 
handle. Fertilizers may be applied to the soil or directly 
to foliage. All nutrients except N are manufactured by 
concentrating and refining ores extracted from mines. 
N fertilizers are manufactured from ammonia, which is 
synthesized from N 2 and H 2 . Science contributes to 
fertilizer use with improved products and methods to 
increase fertilizer efficiency, profitability of nutrient 
used, and reducing adverse environmental effects. 
Technology contributes to fertilizer use by improving 
the efficiency of manufacture and the complex logistics 
of delivering hundreds of million tons of products to 
farms safely, economically, and on time. Fertilizers can 


Introduction 

Fertilizers are indispensable because nutrient supplies 
from the soil are normally inadequate for the 
high-yielding crops needed to supply food and fiber 
for the growing human population. Fertilizers replen¬ 
ish nutrients removed by previous crops and supple¬ 
ment the supply of nutrients from the soil to a level that 
will produce the farmer’s target yield. Fertilizer may 
also improve the quality of human food, stockfeed, and 
fiber. The nutrients in fertilizer are a nonrenewable 
resource and, if they escape from the field, can cause 
environmental damage. This article discusses the sus¬ 
tainability of fertilizer in relation to these issues. 

Before the invention of fertilizers, the only ways that 
farmers could accumulate nutrients on a field was to 
fallow before growing a crop, or to transfer animal 
manure from grasslands and barns to cropped land, 
either directly or by grazing livestock on grassland by 
day and enclosing them by night in a field prior to 
cropping. An additional source of N has been, and 
still is, the residues of previous legume crops, pastures, 
and green manures. With these natural nutrient 
sources, land is temporarily taken out of crop produc¬ 
tion. Fertilizer allows land to be used for continuous 
production. 

The consumption of the macronutrients in fertil¬ 
izer, N, P and, K has grown rapidly since the middle of 
the twentieth century and has increased the food sup¬ 
ply more rapidly than the growth of the human popu¬ 
lation (Table 1). The increase in fertilizer supply has 
been even more rapid than the food supply. It is diffi¬ 
cult to overstate the importance of fertilizers in sus¬ 
taining humanity: Stewart et al. [ 1 ] concluded from the 
results of experiments that production of 30-50% of 
the world’s food is a result of commercial fertilizers and 
Smil [2] calculated from nutrient balances that that 
nourishment of 40% of the world’s population 
depended on N fertilizers. 

The use of the macronutrients is uneven around the 
world, as shown in Fig. 1. On all continents, fertilizer 
provides relatively more N than P or K. The usage of all 
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Fertilizer Science and Technology. Table 1 Growth in 
world population, cereal production, and consumption of 
nutrients in fertilizer during the second half of the twentieth 
century 



Annual growth (%) 

Human population a 

1.8 

Cereals a 

Maize 

2.7 

Rice 

2.4 

Wheat 

2.3 

Fertilized 

N 

7.1 

P 

4.6 

K 

3.7 


a 1961-2001 [3] 
b 1960-1988 [4] 


three nutrients in Europe and North America most 
closely approaches crop requirement while usage in 
Asia and Africa is relatively high in N. The 
K applications are negligible in Oceania and Africa 
probably because soil levels of K tend to be relatively 
high in dry regions. 

This article discusses the sustainability of fertilizers 
from the perspectives of resources and logistics 
involved in supplying fertilizer to the farm, the on-farm 
effects on productivity, and on-site and off-site envi¬ 
ronmental effects. 

Mining and Manufacture of Fertilizers 

The raw materials for fertilizer manufacture, apart 
from N, are mined from ore deposits. In the century 
before 1920, most fertilizer N was also extracted from 
mines and, before that, from accumulations of nitrate 
in soil where farm animals had been housed for long 
periods. Much of this nitrate was used to manufacture 
gunpowder and other explosives as well as for fertilizer. 
The increasing demand for explosives and N fertilizer 
in the nineteenth century led to brutal mining indus¬ 
tries that extracted guano (vitrified fecal deposits from 
sea birds, mostly on tropical islands) and nitrates, 
a sorry history told by Bown [7]. 


In modern mining industries, the nutrient com¬ 
pounds in the ore are concentrated by chemical or 
physical processes into fertilizers that are stable and 
available for plant uptake. In some cases, two or more 
nutrient elements are combined, by chemical reaction 
or physical mixing, into compound fertilizers. Most 
fertilizers are applied as solids, and single-element 
and compound fertilizers are normally manufactured 
as relatively homogeneous granules, which flow freely, 
minimize water absorption, produce little dust, and 
which can be transferred efficiently between storages 
and applied to the soil using machinery or by hand. 
Most nutrients are available as several different chem¬ 
ical compounds. Examples of these compounds and 
the manufacturing processes are described by Tisdale 
et al. [8]. 

Nitrogen fertilizers are produced from ammonia, 
which is synthesized by the Haber-Bosch process under 
conditions of high temperature and pressure using N 2 
and H 2 [2]. The usual source of H 2 is natural gas, and 
the price of N fertilizer is closely linked to the natural 
gas price (Fig. 2). 

Many highly productive farming systems directly 
inject anhydrous ammonia into the soil. The advantage 
of ammonia is its high N concentration (82%) and 
hence low transport cost. Offsetting this is the need 
for expensive containment vessels, typically made from 
thick steel, and other costs of ensuring safety. The use of 
ammonia is confined to application at or before the 
time of sowing because of toxicity to growing plants. 
Applications of a large N amount at these times can 
cause yield reductions, as discussed below. 

Ammonia is used to produce many forms of 
N fertilizer. The most common is urea, which has the 
advantages that it contains a relatively large percentage 
of N (46% N) and is safe to handle, unlike ammonium 
nitrate (34% N), which can be made to explode, and 
anhydrous ammonia, which is toxic to humans and 
animals at low concentrations in the atmosphere. 
Other common N-containing fertilizers are the solid 
ammonium phosphates (10-20% N) and the solution 
of urea-ammonium nitrate (28-32% N). 

Fertilizer Distribution, Marketing, and Price 

Most fertilizer in developing countries is granular and is 
sold in 40-50 kg bags and broadcast by hand, apart from 
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Fertilizer Science and Technology. Figure 1 

Variation between continents in the proportions of N, P, and K used in fertilizer, and some examples of the proportions 
of these nutrients contained in whole crops (fertilizer data from FAOstats, rice data from Dobermann and Fairhurst [5], 
and wheat data from Bar-Tal et al. [6]) 



Fertilizer Science and Technology. Figure 2 

The price of urea generally follows the price of natural gas, which represents the main production cost [9] 


a small but increasing amount applied by machine at the 
time of sowing. In developed countries, most of the solid 
fertilizer is supplied in bulk and increasing amounts of 
liquid fertilizer are used because of the convenience of 


pumping the product between vessels during transport 
and application. These advantages can compensate for 
the additional cost of transporting the water in which 
the nutrients are dissolved. Most of the advantages of 
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liquid are for on-farm handling, so an efficient system 
is for fertilizers to be delivered as solids from the man¬ 
ufacturer to the reseller who then dissolves the solids 
just before delivery to the farm. The advantages of 
liquid fertilizer (and gaseous anhydrous ammonia) are 
greatest for highly productive regions where farms are 
close to resellers and where it is possible to grow two or 
more crops each year and spread the cost of specialized 
equipment for mixing and applying fertilizer over a large 
crop area. Resellers extend the time of distribution by 
offering discounts for early use, but this can be at the 
expense of reduced fertilizer efficiency. One of the haz¬ 
ards of liquid fertilizer is soil compaction due to the 
mass of water and nutrients contained in equipment at 
the time of sowing or in spray carts for post-sowing 
applications. 

Extensive dryland farming systems in developed 
countries use solid fertilizers where the transport 
costs are high. Solid fertilizer has an advantage for 
topdressing because granules can typically be mechan¬ 
ically thrown up to 10-15 m on each side of an imple¬ 
ment, which is consistent with the 10-15-m width of 
seeders and 20-30-m width of boom sprays used for 
controlled traffic. The maximum spreading distance of 
granules may impose limitations as seeders and spray 
booms continue to get wider. 

Some of the largest financial and environmental 
costs of fertilizer are related to transport. Products 
with low concentrations of nutrients are necessarily 
expensive to transport and their use is confined to 
specialized applications, for example calcium nitrate, 
which contains only 16% N, for some vegetables. Dur¬ 
ing the second half of the twentieth century, there has 
been a general increase in nutrient concentrations lead¬ 
ing to reduced cost of transport to the farm and han¬ 
dling on the farm. Nevertheless, the weight of fertilizer 
means that transport is expensive and low-cost trans¬ 
port methods such as ship, barge, and rail are preferable 
to road transport. Transport costs are minimized when 
fertilizer is back-loaded on vehicles carrying agricul¬ 
tural produce to cities. Transport makes up a large part 
of the on-farm cost of fertilizer in rural areas of sub- 
Saharan Africa and presents a major obstacle to 
increasing crop yields [10]. 

The transport cost is even greater for manures and 
compost than they are for inorganic fertilizer. Dry animal 
manures typically contain less than 2% N and 0.4% P, so 


they cannot be economically carried far from the source. 
Instead, manures tend to be applied in a radius up to tens 
of kilometers. Where there is a large source of manure, 
such as a feedlot or large dairy, the available nutrients can 
overload the capacity for crop uptake, so that nutrients 
may accumulate in the soil within a few years. Where the 
manure contains potentially toxic material such as Cu 
contained in pig manure, nutrient loading of the soil 
needs to be closely monitored. 

Fertilizer represents the main demand for N, P, and 
K, but for the micronutrients, usage for fertilizer man¬ 
ufacture is much less than for manufacturing industry. 
In the case of Zn, annual world production is about 
20 million tons, but the amount of Zn contained in all 
cereal grains is less than 0.1 million tons per year. 

The use of compound fertilizers has advantages in 
ease of handling and may be cheaper than single¬ 
element fertilizers. For example, the nutrients in 
ammonium phosphates are generally cheaper to the 
farmer than when they are purchased separately, for 
example as urea and triple superphosphate. In such 
situations, compound fertilizers can be profitable for 
the farmer, even if one of the nutrients is not particu¬ 
larly deficient. However with some NPK blends, there 
may be situations where crops do not respond to all 
nutrients contained in the fertilizer, and where the 
compound fertilizer is expensive, the whole product 
may be unprofitable to use. Such a situation applied 
to many farms in the Philippines where yield of rice did 
not respond to either P or K in a NPK compound 
fertilizer [11]. In this situation, application of the 
NPK was unprofitable on one third of farms and there 
was no simple way of determining which fields were 
deficient in which nutrient element. In such situations, 
the simplest solution is to make single-element 
fertilizers available so that strip trials conducted by 
farmers can show which nutrients give profitable 
responses [12]. 

The main fertilizers such as urea, MAP 
(monoammonium phosphate), and DAP (diammonium 
phosphate) can be regarded as generic and unspecialized 
commodities that are traded freely. They are sold by 
manufacturers to distributors on world markets that are 
generally open and transparent. An insight to the whole¬ 
sale market is available online [13]. There is little oppor¬ 
tunity for manufacturers or distributors to compete other 
than through price and service. Intense price competition 
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does not support the extension services offered by many 
fertilizer companies until the 1980s or 1990s. 

Marketing is generally in the hands of agribusiness 
employees and media campaigns. Increasingly, advice 
about fertilizers is supplied to farmers at or near the 
point of sale by fee-for-service agronomists employed 
by fertilizer vendors. This represents a conflict of inter¬ 
est between the need for agribusiness to maximize sales 
and minimize carryover of fertilizer inventory, and the 
need of farmers to apply the fertilizer at an optimum 
amount and timing. Independent advice to farmers is 
important since fertilizer usually represents the single 
greatest cost of crop production. 

Since generic fertilizers do not provide large profits, 
manufacturers or distributors aim to develop unique 
products that are easy to use or contain additives that 
increase fertilizer-use efficiency. Examples are additives to 
N fertilizers that inhibit urea hydrolysis and so limit the 
potential for ammonia volatilization, or nitrification 
inhibitors that restrict the production of nitrate and 
hence loss by leaching and denitrification [ 14] . The cost 
of these additives may not necessarily be justified by 


additional profit, and they need to be assessed on 
a case-by-case basis [15]. Some urease and nitrification 
inhibitors are powerful biocides and have effects 
other than on N relations, for example, the nitrification 
inhibitor nitrapyrin has fungicidal activity, so any yield 
benefits may not be due only to effects on N supply to 
crops [16]. Slowing the release of nutrients from fertil¬ 
izer by coating granules with a polymer is another 
possible means of maintaining adequate nutrient levels 
near the roots [17]. The cost-effectiveness of these 
products also needs to be assessed on a case-by-case 
basis. The most promising markets for fertilizer addi¬ 
tives are where subsidies are offered to minimize 
N release into the environment. 

World fertilizer prices fluctuate, depending on var¬ 
iations in supply and demand. The fluctuations for 
particular products can be massive, for example, the 
urea price spike in the mid-1990s when Chinese fertil¬ 
izer companies suddenly increased urea imports 
(Fig. 2). The cost of fertilizers generally can also fluc¬ 
tuate strongly, as in 2008 when fertilizer price increased 
in tandem with a rise in world grain prices (Fig. 3). 




Fertilizer Science and Technology. Figure 3 

(a) Farm-gate cost of fertilizer N and price of wheat, both expressed in $A and (b) the cost to price ratio for N and wheat 
grain (updated from Angus [18]). Costs and prices are expressed in Australian dollars ($A) since these represent world 
prices undistorted by subsidies 
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It is clear from Fig. 3 that the price of grain and cost 
of fertilizer tend to readjust to maintain a reasonably 
constant cost:price ratio of about 6, even when grain 
price or fertilizer cost are disrupted by an external 
shock. 

The price of fertilizer in many markets has been 
subsidized by governments, generally leading to fertilizer 
overuse, inefficient methods of application and, in some 
cases, excess agricultural production that is dumped on 
world markets. In some countries, the subsidy is through 
supply of natural gas at less than world price for ammonia 
production. Fertilizer subsidies have similar effects to 
subsidies on the price of agricultural production, in that 
they lead to overproduction because the optimum appli¬ 
cation rate of fertilizer is affected by the ratio of product 
price to fertilizer cost. Subsidies on agricultural products 
and fertilizers have long been criticized because they 
depress the price of agricultural products in world mar¬ 
kets and so reduce the incomes of unsubsidized pro¬ 
ducers. Criticism is also justified because the 
overfertilization promoted by subsidies is a major 
cause of environmental damage [19, 20]. 


Fertilizer Effects on Agricultural Productivity 

In their textbook on plant nutrition, Mengel and Kirkby 
[21] identify 16 mineral elements as essential for plants. 
Three of these, Na, Cl, and Si, are normally present in 
plant tissue at concentrations greater than needed for 
optimum growth and Ni is rarely or never included in 
fertilizers. Table 2 lists the remaining 12. A further two, 
cobalt and selenium, are not essential for plants but are 
essential animal (and human) nutrients and can be 
supplied in fertilizer or administered directly to farm 
animals. The other essential animal and human nutri¬ 
ent, iodine, is not normally supplied through fertilizer. 

The concentration of the nutrients in plant tissue 
varies enormously; for example, N is 100,000 times 
more concentrated than Mo in cereal grain. The poten¬ 
tial yield response to a nutrient, when it is deficient, can 
be estimated from the inverse of the nutrient concen¬ 
tration. So, for example, if the N concentration in grain 
is 0.02, as shown in Table 2, a first approximation of 
potential grain response is 50 kg grain per kg of addi¬ 
tional N, assuming that the N concentration of the 


Fertilizer Science and Technology. Table 2 Typical concentrations of nutrients in cereal grain, C, and the proportion 
of the aboveground nutrient in the grain [21], called the harvest index for the nutrient (HI). The maximum grain response 
to the nutrient is given by HI/C. The maximum net financial return is given by [G(HI/C) - F]/F where F is the cost of 
nutrient in fertilizer and G is grain price, assumed here to be $US 0.2/kg 


Nutrient 

Concentration in 
grain (C) 

Proportion in 
grain (HI) 

Maximum fertilizer efficiency 
(HI/C) (kg grain/kg fertilizer) 

Nutrient cost 
($US/kg) 

Maximum net 
returns ($/$) 

N 

0.02 

0.7 

35 

1.0 

6 

P 

0.003 

0.7 

233 

5.0 

9 

K 

0.004 

0.2 

50 

0.8 

12 

Ca 

0.005 

0.7 

140 

0.2 

140 

Mg 

0.003 

0.5 

167 

1.1 

30 

S 

0.003 

0.4 

133 

0.2 

130 

B 

0.0001 

0.1 

1,000 

12 

15 

Fe 

0.0002 

0.3 

1,500 

3 

100 

Mn 

0.00002 

0.5 

25,000 

5 

1,000 

Cu 

0.00002 

0.5 

25,000 

5 

1,000 

Zn 

0.00002 

0.5 

25,000 

5 

1,000 

Mo 

0.0000002 

0.5 

25,000,000 

35 

150,000 
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grain is unchanged by the fertilizer. In reality, not all the 
fertilizer N is present in the grain. In the case of wheat, 
0.7 is a reasonable estimate of the proportion of above¬ 
ground N contained in mature grain. A better approx¬ 
imation of the maximum grain response to applied N is 
then 50 x 0.7 = 35 kg grain per kg N. Table 2 shows the 
equivalent calculation for all nutrients. This approach 
applies only to the first small application of a nutrient 
and does not account for the diminishing returns of 
yield to additional applications. 

Table 2 also presents estimates of the maximum net 
financial return calculated from the maximum grain 
response, the cost of nutrients in fertilizer, and the 
price of grain. Note again that these are the maximum 
responses and returns, and real returns are normally 
much less because not all the fertilizer is recovered by 
crops. The costs and prices are necessarily assumptions 
and will vary greatly between markets and between 
years. However, the differences between nutrients are 
so great that even with approximations they serve to 
show some important points. 

The maximum responses of additional grain per 
unit of nutrient applied in fertilizer in Table 2 can 
serve as benchmarks. When a crop is growing on soil 
that is believed to be deficient in a nutrient, it should 
give a yield response comparable with these bench¬ 
marks. If it does not it is likely that the nutrient is not 
seriously deficient or there is a limitation other than the 
deficiency that is limiting efficient utilization of the 
nutrient. 

The maximum financial crop responses to the 
micronutrients such as Zn, Cu, and Mo are so large 
that they are normally applied if there is even 
a suspicion of deficiency. There are many farming sys¬ 
tems where micronutrients still give major grain 
responses, for example in central Turkey, where wide¬ 
spread Zn deficiency was identified in the 1990s [22]. 
Continued vigilance is needed in all farming systems in 
case micronutrients become deficient. 

Even for N and P fertilizer, the maximum financial 
responses are large enough to provide a profit even 
when the efficiency of recovery is low. In these exam¬ 
ples crop recovery of only one sixth of the applied N or 
one ninth of the applied P gives a break-even return on 
cost of the fertilizer. 

The grain-yield responses to fertilizers are usually 
much less than these maxima and a great deal of 


research goes into increasing fertilizer efficiency. 
Where deficiencies are severe and there is only one 
nutrient lacking, visual symptoms can be compared 
with standard illustrations [23]. For less acute deficien¬ 
cies, a traditional method is to calibrate yield against 
a soil test and identify the critical nutrient concentra¬ 
tion that gives near-maximum yield [8]. The process of 
soil testing consists of sampling the soil in a field with 
many cores, combining the samples, and analyzing for 
nutrient concentration. 

Soil tests are not the only methods for determining 
nutrient status and Table 3 shows examples of methods 
to estimate N status, including tests related to the 
nutrient status of vegetative plants. Tests of plant tissue 
are generally more reliable than soil tests because the 
volume of soil sampled by plant roots is normally more 
than the soil volume explored by coring [24, 25]. 

There are limitations to basing optimum fertilizer 
application only on tests of soil or plant nutrient status. 
The approach is suitable for a cropping system in which 
prices and yields do not vary greatly from year to year, 
but is less useful where the variation is large. The 
optimum fertilizer application varies with the ratio of 
grain price to fertilizer cost, and the optimum fertilizer 
rate is high when the ratio of grain price is high relative 
to fertilizer cost. 

The optimum fertilizer rate also depends on yield 
potential, and higher rates are justified by increasing 
yield potential. Soil and plant tests emphasize the 


Fertilizer Science and Technology. Table 3 Tests of 
N status 


Pre-sowing tests 

Tests during crop growth 

Field history 

Tissue N concentration 

Total N in the topsoil 

Tissue nitrate concentration 

Mineral N in the topsoil 

Sap nitrate concentration 

N released during soil 
incubation 

Shoot density 

Previous grain protein 

Leaf chlorophyll 
concentration 


Leaf color compared with 
standards 


Lower leaf senescence 


Strip trials 
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Fertilizer Science and Technology. Table 4 Example of 
a nutrient budget for N fertilizer applied to wheat 


Crop N demand 

Target yield 

4 t/ha 

Target grain protein 

12% 

N in grain 3 

84 kg/ha 

N in aboveground crop b 

120 kg/ha 

Soil N supply 

Soil mineral N at sowing 0 

50 kg/ha 

N mineralization during crop growth 

80 kg/ha 

N available for crop uptake (50%) d 

65 kg/ha 

Fertilizer N requirement 

Additional N required in the crop 6 

55 kg/ha 

Fertilizer N required 01 

110 kg/ha 


a Grain N is the product of yield and grain N, assuming that 1 kg of 

N = 5.7 kg of grain protein 

b Assuming grain N is 70% of aboveground N 

c Soil mineral N sampled to a depth of at least 60 cm 

d Assuming 50% of soil and fertilizer mineral N is available for crop 

uptake 

difference between crop demand and soil supply 

supply of nutrients but it is important to place equal 
emphasis on crop nutrient demand. Table 4 shows an 
example of a nutrient budget, based on a method devel¬ 
oped by Myers [26], to illustrate the importance of 
N supply and demand. 

When the quality of the grain is affected by fertilizer, it 
is necessary to consider the responses of both yield and 
quality and the ratios of fertilizer cost to the grain price 
and quality premiums. Angus [27] presents a method to 
estimate the economic optimum fertilizer for wheat in 
relation to responses by grain yield and protein. 

A well-known but generally erroneous notion about 
nutrients is the Law of the Minimum, which claims that 
only one nutrient can limit yield of a particular crop. 
Where one nutrient is drastically deficient, the Law of 
the Minimum is a fairly good rule of thumb, for example 
when a micronutrient such as Zn gives a spectacular yield 
response. More generally, this Law is deceptive, and crops 
and cropping systems tend to regulate the quantity of 
nutrients in the soil by luxury extraction of those that are 
adequate or surplus, and poor extraction of those that are 
deficient. The different rates of extraction continue until 


many nutrients become co-limiting and are required for 
the yield to reach the biological potential [28] . 

The weakness of the Law of the Minimum can be 
seen from the many experiments where fertilization 
with any one of several nutrients can increase yield 
(e.g., [11]). Fertilizer management therefore usually 
needs to consider more than one nutrient. Where sev¬ 
eral nutrients simultaneously limit yield, the combined 
effect may be additive, meaning that the yield response 
of the combined nutrients is the same as the sum of yield 
responses of the nutrients applied singly. In other situ¬ 
ations there may be a positive (or negative) interaction, 
meaning that the effect of the nutrients when applied 
together is greater (or less) than the sum of the effects of 
the individual nutrients. As with many aspects of fertil¬ 
izer management, there is no alternative to conducting 
experiments on farms to quantify crop responses to 
nutrients and their interactions. This rest of this section 
discusses the macronutrients, N, P, and K, and, as exam¬ 
ples of minor and micronutrients, S and Zn. 

Nitrogen 

Nitrogen is used in greater quantity than other nutri¬ 
ents but the financial returns are often less (Table 1). 
Nitrogen fertilizer has a reputation for low efficiency 
and many studies show that a crop typically recovers 
about half the fertilizer N applied, but the proportion 
can vary from zero to almost complete recovery [24]. 
The reasons for incomplete recovery are numerous but 
often the soil can supply as much N as the crop can take 
up, and any additional N remains in the soil where it 
maybe lost in several processes. Nitrogen losses include 
leached nitrate, gaseous ammonia volatilization from 
urea and ammonia-based fertilizer, and nitrous oxide 
(N 2 0) and N 2 from denitrification. Urea is at particular 
risk of loss, mainly because the soil becomes alkaline 
around a dissolving granule, so that any ammonium 
formed is rapidly converted to ammonia, which 
volatilizes. 

Another cause of low crop recovery is immobilization 
of fertilizer N by soil microbes. This process is not neces¬ 
sarily a loss, since the N can be present in the soil organic 
matter (SOM) for one or more seasons, during which it is 
normally mineralized and taken up by later crops. 

Nitrogen is unusual among the nutrients in that 
excess application often causes a reduction in crop 
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yield. In wet and high-yielding environments the rea¬ 
sons for yield reductions are lodging, because the crop 
becomes tall and top-heavy, and because of disease in 
thick foliage. In dry conditions, excess N stimulates 
growth, which can lead to exhaustion of soil water 
before maturity. Another and often more important 
reason is that high N levels reduce the concentration 
of water-soluble carbohydrates in the stems, which is 
an important source of assimilates for grain growth 
[29]. The time of application of N fertilizer is impor¬ 
tant in methods to avoid losses. Responses to topdress¬ 
ing N on wheat during the stem elongation phase are 
comparable to the responses to the same amount of 
N applied at sowing in both highly productive envi¬ 
ronments in Europe [30] and in water-limited environ¬ 
ments in Australia [31]. Fertilizer N applied at sowing 
promotes rapid seedling growth, which leads to more 
risk of lodging, foliar disease, and haying-off than the 
same amount of N applied at stem elongation or later. 
Even when growing rapidly, small plants cannot take up 
large amounts of N, so mineral N in the soil at the time 
of sowing is at risk of loss from leaching and 
denitrification. 

Mid-season application of N is effective because it 
coincides with the period of maximum crop uptake 
and so closely matches supply with demand. Applica¬ 
tion of N during or after the stem elongation phase in 
cereals usually increases grain protein more than appli¬ 
cation at sowing. 

The N losses from irrigated rice can be particularly 
high. De Datta [32] reported that the average recovery 
of N fertilizer in rice experiments was 35-40%. Exten¬ 
sive research showed that the main pathways for N loss 
from lowland rice are ammonia volatilization and deni¬ 
trification when urea is broadcast onto wet soil or 
shallow water. This research did not show ways to 
minimize losses and Fujisaka [33] suggested that 
research should focus more on methods to increase 
N efficiency rather than quantify the loss pathways. 
Subsequent agronomic research showed that with suit¬ 
able methods of application the recovery of N, supplied 
as urea, can be as high as 75% for tropical rice [34] and 
90% for temperate rice at a commercial scale [24]. 

The most efficient ways to increase N use efficiency 
(NUE) are different in each region and with each 
cropping system. Practices that are often effective are 
injecting fertilizer into the soil rather than broadcasting 


it on the surface and delaying application of some or all 
of the fertilizer so that the supply of N is well synchro¬ 
nized with the crop demand. Other ways to increase 
NUE are to ensure that other factors are not limiting. 
These limitations can include deficiency of other nutri¬ 
ents, weeds, herbicide damage, and root disease. An 
example of the importance of overcoming such limita¬ 
tions is apparent in a 40% increase in dryland wheat 
yield in Australia during the 1990s, which coincided 
with the adoption of canola break crops and higher 
applications of N fertilizer. Angus [18] suggested that 
break crops controlled widespread root disease and 
provided suitable conditions for a large yield response 
to N fertilizer. 

There have also been reports of improvements of 
NUE in a single season by classical plant breeding [35] 
or modification of single genes [36]. Genetic improve¬ 
ments in N use efficiency are promising but it is not yet 
clear whether they are consistent across environments 
and seasons, and whether they are as cost effective as 
the established and reliable improvements in crop 
management. 

There is a widespread and recurring problem of 
farmers applying more than the economically opti¬ 
mum rate of N fertilizer. The problem was apparent 
in Europe and the USA from at least the 1970s [20] and 
the same pattern of overuse emerged in many other 
countries, including China since the 1980s [37]. 
N fertilizer applied in excess of crop requirements is 
sometimes called “insurance nitrogen” [38], implying 
that farmers are prepared to pay for the excess provided 
that the yield is maximized. Another explanation lies in 
the concept of “farming styles” of van der Ploeg [39], 
which interprets farming practices in relation to 
farmers’ goals. Farmers who want to boast of the 
highest district yields apply heavy inputs with little 
regard for input costs and net profit. These competitive 
individuals are uninterested in the profit-maximizing 
paradigm of agricultural science. To reverse the pattern 
of fertilizer overuse, it is important to redirect their 
competitiveness to efficiency rather than yield. 

The reason that overfertilization is more common 
with N than other nutrients is not clear, but may be 
related to the rapid and visible response by crops. 
Within 5-10 days of N application, crops usually 
become greener and taller, which gratifies farmers, at 
least when they first observe the response. To this 
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extent, N fertilizer provides its own advertising. The 
greener and taller fertilized crops may or may not yield 
more than those that receive no additional fertilizer. 
Reasons for a lack of yield response may be that the real 
yield limitation is not N but some other limitation such 
as water deficit, disease, deficiency of another nutrient, 
frost, weeds, or herbicide damage. Since the 1990s, the 
application rates of N fertilizer have been static or 
falling in Europe and North America but crop yields 
have continued to rise [40]. Apparently, farmers and 
advisers in these regions are learning the lesson that the 
previous rates were excessive and the time of applica¬ 
tion was too early. 

Phosphorus 

Soil contains P as both soluble and insoluble inorganic 
phosphates and in organic forms including phytates, 
which make up more than 50% of the soil P. Fertilizer 
P adds to the soluble pool that can be taken up by crops, 
but the longer it remains in the soil the more is precip¬ 
itated as insoluble iron and aluminum compounds in 
acid soil, or as insoluble calcium compounds in alka¬ 
line soil. Soils vary in the speed with which they pre¬ 
cipitate soluble P, so soil tests are needed to measure 
plant-available P and the optimum rate of P fertilizer 
[8]. Tests of soil P and other nutrients based on ion 
exchange resins often give closer relations with plant 
uptake than other extraction methods [41]. Plant 
P tests are not as useful as plant N tests because P is 
applied at or before sowing. 

Fertilizer management is designed to maximize and 
prolong P availability to the crop. The most powerful 
method is to include all the P in a band near the seed so 
as to maximize plant access before the P becomes 
unavailable due to precipitation. As with N fertilization, 
it is important to supply only enough for the crop since 
some of the excess P is likely to be precipitated. This 
applies whether the fertilizer is in the solid or liquid 
form. Fiquid forms of P fertilizer are relatively more 
effective than solid forms on alkaline soils because the 
precipitated P is still partly available [42]. 

Most P in the soil is attached to clay particles and is 
not readily leached from any but the sandiest soils. 
Because most soil P is close to the surface it is 
unavailable when the topsoil dries out, so fertilizer 
injection below the surface can improve the efficiency 


of uptake in dry environments [43]. Because of the 
advantages of banding and deep placement, all the 
P fertilizer needs to be applied at the time of sowing. 

Potassium 

Crops require a large supply of K but remove little in 
grain. Most of the K taken up by plants remains in the 
straw and returns to the soil when the straw decom¬ 
poses or is burnt, provided the ash that remains after 
burning is not blown away in the wind. Once in the soil, 
K is not readily leached because of its positive charge 
and tends to be used by crops with greater efficiency 
than N or P. An exception is on soils that contain a large 
proportion of 2:1 clay minerals, where K is strongly 
fixed and becomes unavailable to roots. 

Other Nutrients 

The other nine nutrients commonly applied as fertil¬ 
izers are not all discussed individually and the reader is 
referred to nutrient aspects by Mengel and Kirkby [21] 
and fertilizer aspects by Tisdale et al. [8]. This section 
discusses S and Zn because of their increasing 
importance. 

Sulfur is present in the soil as sulfate, which as an 
anion is subject to leaching. Until the later part of the 
twentieth century, it was inadvertently applied in many 
regions. One form was in superphosphate, which was 
used for its P content (9%) but supplied S (11%) as 
well. The “high analysis” fertilizers, which largely 
replaced superphosphate, contain relatively small 
amounts of S. Another source of S for plants was 
atmospheric deposition as oxides of S from smoke¬ 
stacks, particularly on smelters of metal sulfides. This 
source of S is also decreasing because smokestacks are 
being fitted with “scrubbers” to reduce air pollution. 
Fertilizer S will be increasingly needed to replace these 
sources. 

Zinc is important for not only increasing crop 
growth, but also as an essential human nutrient that 
is becoming seriously deficient. Zinc deficiency is wide¬ 
spread in soils and plants, particularly in west and 
south Asia [44], and methods and products are avail¬ 
able in some countries to correct Zn deficiency [23]. 
Some, but not enough, compound fertilizers in devel¬ 
oped countries include Zn but in many developing 
countries, Zn is available only as zinc sulfate. 
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Sometimes this expensive compound is diluted by 
unscrupulous retailers with salts that look similar, 
such as ammonium sulfate, so the product is an inef¬ 
fective source of Zn (R.J. Buresh, 2011, personal 
communication). 

Much of the Zn content of plants is transferred 
from the soil to plant roots by arbuscular mycorrhizal 
fungi (AMF). The large surface area of AMF filaments 
increases the ability of plants to explore the soil for 
immobile nutrients such as Zn. AMF are highly effec¬ 
tive in soils with low nutrient status, but the modern 
farming practices of high levels of P fertilizer and grow¬ 
ing brassica break crops tends to reduce AMF activity 
and increase the need for Zn fertilizer. Breeding plants 
for increased Zn uptake from soil can be effective in the 
short term but without supply of Zn fertilizer will 
exhaust soil reserves and hasten the onset of severe 
deficiency [45]. 

Fertilizer Recommendations and Decision 
Support 

Methods for prescribing the fertilizer amount evolved 
from blanket recommendation provided by advisers, 
meaning the same practice irrespective of fields or 
seasons. Soil tests and nutrient budgets, as described 
earlier, give more differentiated prognoses of response 
can be used to develop “rules of thumb,” which are 
usually accepted by busy farmers. Computer-based 
decision support systems can take account of many 
factors that are known to affect nutrient response but 
are not as widely adopted as simple systems and rules of 
thumb. McCown [46] concluded that adoption of deci¬ 
sion support systems had not met the promise they 
offered. 

Fertilizers and Precision Agriculture 

Precision agriculture is a set of principles that has 
developed since the 1990s, aimed at managing crops 
at a scale of several meters rather than as a whole field. 
It relies on a global positioning system (GPS) that 
estimates location with precision ranging from 4 to 
0.02 m, spatial data about crop production such as 
maps from a yield monitor, biomass images from 
a satellite or aircraft, or spatial data about soil condi¬ 
tions such as electrical conductivity [47]. Fertilizer can 


be managed by precision agriculture by varying the 
application rate according to the demand by the crop 
and/or the supply from the soil in different zones in 
a field. The farmer or adviser first prepares a prescrip¬ 
tion specifying the fertilizer rate for each zone and 
programs this in a computer carried in a tractor. This 
computer continuously determines where the tractor is 
located relative to the zones in the field, based on 
signals from the GPS receiver, and controls an actuator 
on the fertilizer implement that delivers the appropri¬ 
ate rate of fertilizer in each zone. 

Yield maps provide useful data for deciding on 
fertilizer strategies and tactics. If one part of a field is 
consistently low yielding and it has previously received 
the same amount of fertilizer as the rest of the field, 
then it is likely that soil nutrient levels have accumu¬ 
lated. This pattern is common in the case of P fertilizer, 
and provides evidence to reduce the application rate on 
low-yielding zones. Variable applications of N fertilizer 
are a common use of yield maps. An example is in 
response to natural variation in soil organic matter, 
with additional N supplied to low-fertility zones [48]. 
Another example is in response to “unnatural” soil 
variation due to land leveling, where “cuts” expose 
subsoil containing low nutrient levels and “fills” consist 
of additional amounts of topsoil that is relatively fertile. 

In these examples, additional fertilizer is designed 
to compensate for low soil nutrient status. If yield 
variation is due to soil properties other than nutrient 
status, then it may be inappropriate to compensate for 
low yield with more fertilizer and the best strategy is the 
exact opposite, that is, to apply more fertilizer to zones 
that consistently give the highest yields. An example of 
this situation is when spatial yield variation is due to 
the differences in the available soil water supply. Angus 
et al. [49] investigated yield and crop response to 
N across fields that varied in subsoil constraints of 
salinity, sodicity, and high concentration of boron. 
They showed that wheat yield and the crop response 
to N fertilizer were greatest on parts of a field with the 
least subsoil constraints, and yield responses to applied 
N were least on zones with the most severe constraints. 
Even in the absence of subsoil constraints, the soil 
water-holding capacity can control yield of dryland 
crops, and the highest rates of N fertilizer should be 
applied to zones with large water-holding capacity 
[50]. Spatial soil variation is often best measured by 
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electromagnetic induction, which directly measures 
apparent electrical conductivity, which is a proxy for 
other properties [51]. 

Where the response of a crop to fertilizer depends 
on its N status rather than other soil conditions, an 
optical sensor can provide useful information about 
a static or on-the-go estimate of crop N status. Optical 
systems using infrared sensors, such as the Greenseeker 
[52] and N-Sensor [53], can estimate N status of crops 
at a scale of about 1 m 2 while mounted on a tractor or 
fertilizer distributor, and can regulate the fertilizer rate 
on-the-go. A simple system using a domestic digital 
camera can provide equivalent information for a single 
scene [54]. 

The previous examples applied when the spatial 
yield patterns are consistent between years. A more 
complex approach is needed when the fertilizer 
response is inconsistent between zones and between 
years. For example in a dry season, N fertilizer may 
increase yield in low-lying, wet zones of a field but 
reduce yield in hilly, dry zones. Conversely, in a wet 
season, N fertilizer may be most effective in the hilly 
zones. The best response to this situation is not clear. 
One possibility is to examine yield maps for several 
years and manage fertilizer for an average season. 
Another is to delay the decision to apply fertilizer 
until the seasonal pattern is apparent. It may also be 
possible to minimize yield variation due to water 
movement by land-forming and drainage. 

Fertilizers in Organic Farming 

Many consumers are prepared to pay premiums for 
organically grown produce and a small proportion of 
farmers want to supply food produced by one of the 
organic farming systems. For certification of organic pro¬ 
duce, the several organic farming systems generally 
require nutrients to be applied as manure and 
unprocessed minerals but not as processed fertilizer. 
Application of sufficient manure can provide optimum 
crop nutrition and until recently it was assumed that 
there were no disadvantages to the nutrient relations of 
organic farming. Research compiled by Kirchmann and 
Bergstrom [55] shows that this assumption is invalid 
because there is consistently more N leaching under 
organic than conventional systems, apparently because 
the soil N supply is not well synchronized with crop 


N demand. These results suggest that the nutrient rela¬ 
tions of organic farms may be less sustainable than con¬ 
ventional farms. Many governments subsidize organic 
farming systems and thus support unsustainable nutrient 
management. 

In another comparison of organic and conventional 
farming systems, Fagerberg et al. [56] showed that 
nutrient levels of the organic part of a split farm 
remained adequate for several years after conversion 
from conventional, but that later some nutrients 
become depleted and productivity fell. The likely rea¬ 
son was that these nutrients had accumulated during 
several decades of fertilizer application before half the 
farm was converted to organic farming. 

Residual nutrients from previous fertilizer applica¬ 
tion are not the only source of N in organic farming. 
In parts of north America and western Europe, the 
deposition of mineral N from the atmosphere con¬ 
tributes a large part of crop requirement [20]. The 
sources are ammonia emitted from manure in intensive 
animal industries and nitrogen oxides emitted from 
vehicles. Both sources of N originate from industrial 
operations, so it is difficult to understand how any 
farming conducted in such regions can be designated 
as organic. 

Some strands of organic farming propose applica¬ 
tion of plant extracts and microbial preparations to the 
soil to promote crop growth, and such products have 
spread in the decades since there was regulation of 
agricultural chemicals. Reports of the effectiveness of 
some of these products were reviewed by Edmeades 
[57], who concluded that they gave no significant 
yield benefit when used as recommended. A manufac¬ 
turer of one such product objected to Dr. Edmeades’ 
comments and took legal action against him in New 
Zealand courts. Dr. Edmeades won the case but effec¬ 
tively lost his scientific career and wasted a year 
enmeshed in the legal system [58]. It is not clear how 
reputable science should deal with claims that “alter¬ 
native” products provide production and environmen¬ 
tal benefits out of all proportion to their nutrient 
composition. Even if one such product is proved to be 
ineffective and removed from the market, it is likely 
that others will take its place. A solution may be to 
strengthen consumer protection laws so that manufac¬ 
turers can be penalized for making unsubstantiated 
claims. 
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Environmental Benefits of Fertilizers 

Fertilizers provide environment benefit by increasing 
soil organic matter in nutrient-deficient soils. For 
example, superphosphate provided P and S for vast 
areas of deficient soil in Australia, which provided 
that trigger for biological nitrogen fixation by pasture 
legumes [59]. Even more spectacular increases in soil 
organic matter came when micronutrient deficiencies 
were corrected with fertilizers in the Western Australian 
sandplain. These are the only positive environmental 
effects of fertilizer and other effects are neutral or 
negative. 

Pollution of Surface Water and Groundwater 

When there is nitrate in the soil and the water supply 
from precipitation and irrigation exceeds evaporation, 
the nitrate will be leached and eventually enter the 
groundwater. Levels of groundwater nitrate began to 
increase in the second half of the twentieth century, 
most notably in Europe and North America, and have 
stubbornly remained high. For example, Johansson 
and Gustafson [60] reported that after N-fertilizer 
application to an arable field in southern Sweden was 
discontinued, there was a delay of 20 years before the 
concentration of subsoil nitrate decreased. 

The best-known environmental damage from nutri¬ 
ents is water pollution in lakes, rivers, and parts of the 
ocean. The pollution causes hypoxia, algal blooms, and 
death of pelagic fish in many water bodies, the largest of 
which are the Gulf of Mexico [61], the Baltic Sea [62], 
and the Yellow Sea [63] . The delay in affecting nutrient 
levels in such water bodies is probably longer than for 
groundwater, as shown by the constant levels of 
dissolved N in the Baltic Sea 10 years after N fertilizer 
usage almost stopped in the eastern Baltic countries 
after the breakup of the Soviet bloc [62]. 

Both N and P from fertilizer contribute to pollution 
of large water bodies but even if N from fertilizer were 
not present, blue-green algae would be able to obtain 
their N requirements from biological fixation. Other 
evidence for the importance of P in water pollution is 
that the size of the “dead zone” in the Gulf of Mexico is 
more closely related to the discharge of P than of 
N [64]. The source of nonpoint P pollution may not 
be confined to fertilizer, and streambanks disturbed by 
humans and grazing animals have the potential to 


release soluble P into water bodies. A promising 
method to reduce phosphate release into streams is to 
exclude grazing animals from riparian strips. 

Systems to limit nutrient pollution of water have 
included fertilizer taxes and quantitative limits of 
nutrient loading from both fertilizer and manure. 
N fertilizer was taxed in Sweden for many years but 
there was little change in the rate of N application to 
crops. In the Netherlands, a system of nutrient 
accounting of inputs and outputs limits the nutrient 
balance of farms to reduce nutrient movement into 
groundwater and streams [65]. A promising method 
is to define sensitive zones, based on infiltration rate or 
proximity to streams, where fertilizers are banned, lim¬ 
ited, or taxed [66]. 

Intensive research continues on the nitrogen cascade, 
a phrase that recognizes a chain of intended and 
unintended consequences when reactive N moves on 
land and in the air and water. An example of coordinated 
research in western Europe is given by Sutton et al. [67] . 

Greenhouse Gas and Ammonia Emissions 

Nitrous oxide (N 2 0) and N 2 are released from soil into 
the atmosphere by the processes of nitrification [68] 
and denitrification [69]. N 2 0 absorbs long-wave radi¬ 
ation and remains in the atmosphere for a long time so 
its greenhouse effect is much greater than that of C0 2 . 
The International Panel on Climate Change estimates 
the emission of N 2 0 from agriculture as a percentage of 
the N fertilizer applied. This approach takes no account 
of nitrification and denitrification of non-fertilizer 
sources, mainly from mineralization of organic N. 
Since fertilizers provide about half of the N used for 
food production, the contribution of the organic N to 
N 2 0 emission is probably underestimated. The most 
promising strategy to reduce N 2 0 loss are to minimize 
waterlogging, since denitrification proceeds most rap¬ 
idly in anaerobic conditions. 

Gaseous loss of NH 3 is greatest when ammonium is 
on or near the surface of an alkaline soil, or in the 
floodwater of rice. Windy conditions at the soil surface 
also contribute to rapid loss. In the worst conditions, 
the loss of fertilizer N can be up to 10-15% per day 
[15], but the loss can be reduced or eliminated by 
injecting ammonia-based fertilizers below the soil sur¬ 
face, or, if top dressing is unavoidable, by applying 
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fertilizer before forecast rain, so that the granules are 
dissolved and transported below the soil surface. 

Soil Acidification 

Ammonia-based fertilizers, including urea, acidify the 
soil when protons are released during nitrification. Soil 
pH falls more rapidly in light-textured and poorly 
buffered soils than in heavy-textured and highly buff¬ 
ered soils, but the process continues irrespective of soil 
type. The consequence of acidification is reduced plant 
growth, not because of the protons, but because of 
increasing concentrations of aluminum and manga¬ 
nese in the soil solution that accompany reduced pH. 
The most common cure is to apply lime, which repre¬ 
sents a deferred and indirect cost of N fertilizers and an 
environmental cost since the reaction of lime on acid 
soil releases C0 2 . A less common response is to apply 
nonacidifying fertilizers such as those containing 
nitrate, but these are more expensive than ammonia- 
based products. 

Does N Fertilizer Deplete Soil Organic Matter 
(SOM)? 

Until recently, there was a general belief that N fertilizer 
had little or no effect on SOM but this was challenged 
from analysis of soil data from the Morrow Plots, 
a long-term experiment at the University of Illinois in 
Urbana. Khan et al. [70] analyzed soil C and Mulvaney 
et al. [71] analyzed soil N from these plots and came to 
the startling conclusion that adding N fertilizer 
decreased SOM, apparently because the N fertilizer 
stimulated soil respiration. If true, this conclusion 
would prove that the modern N fertilizer industry is 
unsustainable. 

Reid [72] and Powlson et al. [73] criticized these 
interpretations of the Morrow data. Their criticism was 
that the experiment had been confounded because the 
plot that received a large N dose from 1965 to 2005 had 
previously (1904-1966) received large annual doses of 
manure that had built up SOM to very high levels. After 
the manuring finished on that plot in 1967, the SOM 
decreased simply because it was above the equilibrium 
level for the environment, and not because of 
N fertilizer. This is a strong argument because there is 
ample evidence that soil maintains a high level of SOM 


only if there is a constant input of organic matter to 
offset soil respiration, and this material contains C, N, 
P, and S at ratios close to that of SOM. Kirkby et al. [74] 
surveyed data on SOM content and estimated that the 
average C:N:P:S ratio of humus was 1000:80:20:14. 
After manuring stops, the SOM decreases for many 
decades until it returns to the equilibrium for the envi¬ 
ronment [73]. The decrease in SOM in the Morrow 
plots can be explained more by the oxidation of 
manure than the effect of N fertilizer. The specific 
criticisms by Reid [72] and Powlson [73] about the 
confounded treatments have not been challenged after 
3 years of debate on the subject [75] so it appears that 
the concern about N fertilizer depressing SOM was 
a false alarm. 

Nutrient Transfer from Farmland to Natural 
Terrestrial Environments 

It has long been known that nutrients move from 
farmland to natural environments on land, as well as 
to freshwater and oceans, as described above. Pathways 
for nutrient transfer are water, gas, dust, and wild 
animals [76]. Movement by water is likely to remain 
in the stream system so is relatively unimportant for 
transfer to terrestrial environments. The only nutrient 
moved as gas is ammonia, and the main agricultural 
source of which is housed animals rather than emis¬ 
sions from fertilizer, although the original source of the 
nutrients is mostly fertilizer. 

Nutrient movement in dust has not been widely 
discussed in the agricultural literature but is of more 
interest for earth science. Nutrients move in dust 
when wind lifts clay particles, which normally contain 
a large proportion of organic matter [77]. When the 
wind abates, the particles may land on another agricul¬ 
tural field where the nutrient transfer is relatively 
unimportant, and in natural environment where the 
nutrients may cause environmental damage. 

The other source of nutrient transfer is through 
animals. Where farmland is adjacent to natural vegeta¬ 
tion, it is common to see wild animals grazing on crops 
and pastures during the morning and evening and 
retreating into the cover of bush during the day. Exam¬ 
ples are deer in Europe and North America, kangaroos 
in Australia, and birds almost everywhere. The obvious 
attraction to animals of the margin between farm and 
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bushland is access highly nutritious feed on the farms 
and cover in the natural vegetation. Nutrient transfer 
occurs through urine and dung deposited in the 
bushland. A consequence of nutrient enrichment of 
natural vegetation from all sources is eutrophication 
of the landscape, analogous to eutrophication of water 
[78], with weedy annual plants competing with native 
perennials. Nutrient transfer over short distances could 
place limits on the development of mosaic farming 
systems where farming and natural vegetation are 
distributed through a landscape according to land 
capability. 

Human Health 

Heavy metals can enter the human food chain from 
fertilizer. Cadmium (Cd) is the most toxic and is 
a contaminant in many phosphatic fertilizers [79]. 
When Cd is applied to a field it accumulates first in 
crops, grazing animals, and then in the tissue of people 
who consume the products. The most promising way 
of reducing Cd levels in fertilizer is by manufacturing 
phosphatic fertilizer from sources of rock phosphate 
that contain low levels of Cd [80]. 

Another supposed health hazard from fertilizer was 
nitrate, but the concern turned out to be a false alarm. 
From the 1940s until the 1990s, nitrate was believed to 
cause blue-baby syndrome (methemoglobinemia) 
because some cases of this disease that were associated 
with groundwater containing high nitrate levels, and 
this was the main reason for an official limit of 50 mg/L 
of nitrate in drinking water. Evidence reviewed by 
Addiscott [20] showed that nitrate at this concentra¬ 
tion was harmless and that the original basis for this 
limit was faulty because water containing high nitrate 
concentrations also contained bacterial contamination, 
which was the real culprit for methemoglobinemia. 

There are however beneficial effects of fertilizers on 
human health through micronutrients that are essen¬ 
tial for human health. The best example is Zn, which is 
widely deficient in the human diet. The World Health 
Organization concluded that human Zn deficiency was 
one of the most serious causes of poor health [81] . Low 
levels and low bioavailability of Zn in human food are 
the cause. Increased use of zinc fertilizer would be 
a boon to human health, particularly to people who 
rely on a vegetarian diet, since plant products contain 


less Zn than meat. In view of the difficulty in correcting 
Zn deficiency in crops, supplementation of human 
food with Zn may be more effective [45]. 

Resource Availability 

World food supply relies on increasing amounts of 
nonrenewable resources to provide fertilizers. In the 
case of N, the resource involved is energy, and about 
1% of the world’s supply of fossil fuels is consumed in 
its production. Production of all the other nutrients 
depends on mining as well as relatively small amounts 
of energy used in manufacture and transport. 

The energy cost of producing N fertilizer has 
decreased as technology improved. For example, the 
amount of natural gas needed to synthesize ammonia 
in the most efficient plants halved in the last 40 years of 
the twentieth century, and by 2000 was within about 
25% of the highest possible efficiency [2]. With 
diminishing scope for improved efficiency and exhaus¬ 
tion of natural gas, it is likely that the real cost of 
N fertilizer will rise. 

The nutrient at greatest risk of exhaustion is 
P. Cordell et al. [82] estimated that “peak phosphorus” 
had been reached and that reserves would be exhausted 
by 2030. This conclusion is based on the extrapolation 
of sigmoid curves of production, similar to those used 
to predict the exhaustion of crude oil reserves. Alter¬ 
native approaches reviewed by Cornish [83] comes up 
with estimates of reserves lasting at least until 2100, 
based on current rates of extraction and the amounts 
and quality or known reserves compiled by the US 
Geological Survey. Whether the reserves will last for 
30 or 100 years, there is increasing concern that P will 
be the most limiting nutrient in the long term. The 
most promising ways of prolonging supplies is to 
reduce overuse of P fertilizer and find ways to minimize 
phosphate fixation in soils. If exhaustion of P is as 
imminent as the worst predictions, it will be necessary 
to find ways of recycling at the important points of loss, 
which are in food preparation and in human urine. 
Recycling options are discussed by the Global Phos¬ 
phorus Research Initiative [84]. 

Reserves of high-quality K fertilizers are likely to 
outlast those of P and probably N [83]. One concern 
about K supplies is that they are concentrated in the 
hands of relatively few producers who could be in 
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a position to corner the market. Another concern is 
that the largest reserves of K fertilizer in Canada 
and Russia are remote from the regions of greatest 
K deficiency in Asia. 

Future Directions 

Existing science and technology shows that crops can 
respond reliably and profitably to fertilizers when used 
cautiously. This involves applying fertilizer so that 
the combined supply from fertilizer and the soil is 
sufficient for, but does not exceed, crop requirement. 
This principle has not penetrated far into farming 
systems, and fertilizer management varies from gener¬ 
ally inadequate applications in sub-Saharan Africa to 
excess use of N in East Asia. Many farms in Europe 
and North America have passed the period of excess 
N use. However the efficiency of fertilizer use is still 
generally low and improvement will require incremen¬ 
tal research on farms to refine methods to apply fertil¬ 
izer efficiently and in ways that are compatible with 
farming systems. As farming systems change, as they 
inevitably will, fertilizer management will have to 
change also. The most promising methods are to 
arrange fertilizer rate, time of application, spacing 
and depth of placement to synchronize supply with 
crop demand, and to adjust these parameters in rela¬ 
tion to zones in the field and region. The methods will 
have to be cost effective, so it will be important to 
continue to develop methods to manage low-cost fer¬ 
tilizers appropriately rather than resorting to high-cost 
additives, which may have to be supported by subsidies. 
Science should also critically evaluate and comment on 
“alternative” fertilizer products and systems that make 
outrageous claims and will lead to reduced food secu¬ 
rity if widely used. 

More fundamental science should be directed 
to liberating the large amounts of unavailable soil 
nutrients, while recognizing that release of these 
bound nutrients will only delay the need for addi¬ 
tional fertilizer. The largest environmental improve¬ 
ments related to nutrient overuse are likely to come 
from research to increase fertilizer efficiency on farms. 
The delay between conduct of farm-related research 
and its benefits on a regional scale can be many 
decades [85], so well-resourced field research needs to 
be maintained. 
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Glossary 

Fissile Fissile isotopes are fissionable by the capture of 
neutrons of any energy, but are especially easily 
fissioned by the capture of slow neutrons, for 
example, U 233 , U 235 , Pu 239 , and Pu 241 . 

Fertile Fertile isotopes may be transmuted into fissile 
isotopes by neutron capture. The naturally occur¬ 
ring fertile isotopes are Th 232 and U 238 . 

Critical A critical fission reactor is in a steady state, 
with its neutron population sustained by a chain 
reaction. 

Reactivity Reactivity is a dimensionless parameter, 
which characterizes how far from critical a fission 
reactor is. If zero, the reactor is critical; if positive, 
the reactor is supercritical and its neutron popula¬ 
tion is increasing; if negative, the reactor is 
subcritical. 

Microscopic cross section A microscopic cross sec¬ 
tion is a parameter, with dimensions of area, that 
is a measure of the probability of a particular 
reaction resulting from an incident particle on 
a target nucleus. The macroscopic cross section for 
this “particular” reaction is the microscopic cross 
section times the number density of the target 
nucleus. 


Definition of Subject 

At the end of the nineteenth century and through the 
first half of the twentieth century, revolutionary dis¬ 
coveries were made in physics, and the laws of physics 
and our understanding of them were greatly expanded. 
In addition, tragic historical events led to an unprece¬ 
dented concentration of intellectual talent and eco¬ 
nomic resources (the Manhattan Project) that allowed 
the new physics to be applied to the engineering of 
nuclear (fission) reactors. This entry will describe the 
advances in physics, which are key to fission reactor 
design, and how they enable this engineering practice. 

Introduction 

In 1900, Lord Kelvin (William Thomson) reportedly 
told the British Association for the Advancement of 
Science that “there is nothing new to be discovered in 
physics now. All that remains is more and more precise 
measurements.” Whether he actually said this or not, it 
is reasonable to believe that many scientists and engi¬ 
neers of his day would have concurred. Newton s def¬ 
initions and laws of mechanics and optics had long 
been successfully applied. Maxwell’s equations, Ohm’s 
law, etc. seemed to describe electricity and magnetism. 
Boltzmann and Gibbs had provided the foundations of 
statistical mechanics and thermodynamics. And chem¬ 
ists had been busy developing atomic theory, identify¬ 
ing 92 elements, the laws of chemical combination, the 
weights and sizes of atoms and molecules, and the 
periodic system. 

With hindsight it is clear, however, that in 1900 
there were many intriguing questions outstanding in 
the physical sciences, and there was an historically large 
cohort of scientists, being produced by the major uni¬ 
versities of the day, ready to address them. The ques¬ 
tions (and their resolutions) of prime importance to 
“fission reactor physics” are: 

1. Does a theory of relativity apply to Maxwell’s equa¬ 
tions, and is there a unique frame of reference 
(ether) for the propagation of light? 

2. Why are the heaviest naturally occurring elements 
unstable, giving off various forms of “radiation” 
and transmuting to different elements? 

3. What does the quantization of electromagnetic 
radiation (required to describe black body 
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radiation energy spectra and the photoelectric 
effect) mean to the laws of physics on the atomic 
scale? 

The resolution of each of these questions will be 
discussed in this entry, as they are the starting points 
for the accumulation of knowledge needed to charac¬ 
terize the workings of fission reactors. 

Clearly, Einstein s Theory of Relativity addressing 
question (1), and its identification of mass as a form of 
energy (1905) would, excuse the bad pun, energize the 
whole effort. Already in 1914, H. G. Wells in his novel 
“The World Set Free” envisioned industrial atomic 
energy and atomic bombs used in a catastrophic 
world war. 

At the end of the nineteenth century, electro- 
chemists looking for heavy elements (heavier than 
lead and bismuth) found that “radiation” was given 
off by the materials they were investigating. Becqueral 
(1896) observed y rays (penetrating electromagnetic 
radiation similar to x-rays) from uranium salts. The 
Curies (1898) observed a and (3 rays from polonium 
and radium. Rutherford showed that the positively 
charged as were doubly ionized helium atoms. The (3s 
are negatively charged electrons, the same particles as 
the cathode rays that Thomson characterized and 
named (1897). These “radiations” proved to be key 
tools for determining the structure of atoms. The a 
particle was shown by Rutherford (1911) and his 
coworkers to scatter from gold foil in a manner incon¬ 
sistent with the atomic model of the day, Thomsons 
raisins (electrons) in the pudding (positive charge 
medium) model. To explain the a scattering results, 
an atom’s positive charge and its mass, minus that of its 
electrons, needed to be concentrated in a small nucleus 
(radius ~ 10 -12 cm), with its electrons distributed over 
a much larger volume (radius ~10 -8 cm), that of the 
whole atom. Niels Bohr, inspired by Rutherford’s work, 
took to determining the distribution of atomic elec¬ 
trons. His success, building off Question (3) above, 
led to quantum mechanics. A complete model for the 
atom, however, still required an explanation for the 
mass of the nucleus. Again bombardment of various 
atoms (elements) with a particles led to the answer. 
Chadwick (1932) proved that the “rays” produced by 
as striking beryllium nuclei were neutral particles with 


mass slightly greater than the hydrogen nucleus, the 
proton. These neutral particles are the neutrons that 
had been hypothesized by Rutherford 12 years earlier. 
Heisenberg (1932) produced a detailed model of the 
atomic nucleus where the mass number A is the total 
number of elementary particles, protons plus neutrons, 
making up a nucleus, and the nuclear charge is Z, the 
number of protons. Thus, there can be various isotopes 
for a given element, more than one A for a given Z. 

The discovery of the neutron marked the start of 
furious activity, culminating in the operation of the 
first fission reactor only 10 years later. Leo Szilard in 
1933 recognized that a neutral neutron with modest 
kinetic energy could penetrate an atomic nucleus and 
cause a reaction releasing nuclear (mass) energy, and if, 
as part of the “reaction,” additional neutrons were 
produced, a chain reaction could result. Szilard pro¬ 
duced a patent for a reactor based on this idea and 
assigned it to the British Government in 1936 (before 
fission was discovered). In 1934, Fermi was using neu¬ 
tron bombardment (with neutrons of various energies) 
to produce nuclear transformations in many elements. 
Of special interest was the production of transuranic 
elements, Z greater than 92. Fermi won the 1938 Nobel 
Prize for this work. However, unknown at the time, he 
had also fissioned uranium. This was determined by 
electrochemical analysis of the products of neutron 
bombardment of uranium by Hahn and Strassmann. 
Subsequently, the process was identified as fission by 
Meitner and Frisch. Bohr recognized that the ease with 
which low energy neutrons could cause fission of ura¬ 
nium was due to the existence of the naturally occur¬ 
ring, but low atom percent (0.72%), isotope 92 U 235 [1] 
(Various notations have been used to designate 
a particular isotope, for example, for uranium with 
mass number (A) 235; 92 U 235 , U235, and H 5 U. The 
latter is in common use today. For ease of composition 
and for consistency with most of the references used in 
this entry the older standard, A as a right superscript, is 
used.). He and Wheeler, from their Theory of Fission 
[2], also recognized that the not yet produced isotope 
94 Pu 239 , would also be readily fissioned by slow neu¬ 
trons [3]. This was in early 1939. Bohr still did not 
think production of a fission bomb to be feasible. 

Leo Szilard was, however, not deterred. He per¬ 
suaded his friend Albert Einstein to write President 
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Roosevelt (8/2/1939), urging government support 
of fission research and the stock piling of uranium. 
This ultimately led to the Manhattan Project. In 1940, 
Seaborg and McMillan synthesized the readily fission¬ 
able isotope of plutonium, 94 Pu 239 , which is produced 
by neutron capture in the dominant uranium isotope 
92 U 238 . Wheeler credited Louis Turner [3] with 
pointing out that kilogram quantities of 94 Pu 239 could 
be produced in a large fission chain reaction reactor. 
Fermi and Szilard [4] designed and built the prototype 
for such a reactor, a “pile” of graphite blocks containing 
an array of natural uranium pellets. It was constructed 
in a squash court under a grand stand of the University 
of Chicago’s Stagg Field, and went critical (sustained 
a chain reaction) on December 2,1942. The Manhattan 
Project built large reactors of this type for weapons 
material production, and also successfully pursued 
means of enriching uranium in 92 U 235 . Enriched ura¬ 
nium allows more compact, higher power density, reac¬ 
tor designs. 

The Manhattan Project brought together extraordi¬ 
nary scientific and engineering talent, and immense 
resources to produce the weapons that ended the Sec¬ 
ond World War. It also provided the foundation for all 
fission reactor development that has followed. The 
subsequent advances in “physics,” which have contrib¬ 
uted to this development, are principally: 

1. The full understanding of the interaction of neu¬ 
trons with nuclei: scattering (elastic and inelastic), 
and capture (simple absorption, transmutation, 
and fission), including measuring the parame¬ 
ters that characterize the probabilities of these 
“interactions” 

2. The formulation of methods to solve the neutron 
transport (Boltzmann) equation, which governs the 
behavior of the dilute “gas” of neutrons in a fission 
reactor 

This entry will discuss the topics, pre- and post- 
Manhattan Project, which encompass the physics of 
fission reactors. 

Mass-Energy Relationship 

In his initial paper [5] on the theory of relativity, 
Einstein confronted the problem of guaranteeing that 


the laws of electromagnetism (Maxwell’s equations) 
apply in all inertial reference frames, just as the laws 
of mechanics do. In an inertial reference frame, an 
object, which is at rest, remains at rest and an object 
traveling with a particular velocity will maintain that 
velocity. Einstein asserted that there is no preferred 
reference frame (like stationary ether in space, as pos¬ 
tulated years earlier), and that the speed of light c, in 
vacuum, 2.998 x 10 8 m/s, is the same in all inertial 
reference frames. From these assertions, Einstein 
derived transformations for various variables in the 
laws of physics from one inertial reference frame to 
another. This solved the “electromagnetism” problem 
and provided a firm grounding (theory) for phenom¬ 
ena observed when velocities approach the speed of 
light. For examples of the latter, see Kaplan, “Nuclear 
Physics” on the charge-to-mass ratio of the electron as 
a function velocity, and Mermin, “It’s About Time,” on 
the half-life of unstable particles as a function of their 
velocity. Our interest here is specifically on the rela¬ 
tionship between mass and energy resulting from the 
special (not applying to gravity) theory of relativity. 
What is meant by the ubiquitous formula. 

E = Me 2 ? (1) 

For application to fission, an inelastic collision 
between two particles will be treated for relativistic 
conditions. The approach presented by Mermin in 
“It’s About Time” will be used. 

In an elastic collision, total momentum, P = Pi+ p 2 > 
mass, M = m : + m 2 , and kinetic energy, K = k : + k 2 are 
all conserved, where the mass, m, is an inherent prop¬ 
erty of a particle and is a measure of how it resists 
a change in its velocity. In an inelastic collision, only 
total momentum, P needs to be conserved. It needs to 
be conserved, however, in all inertial frames of refer¬ 
ence. For relativistic conditions, one defines a particle’s 
momentum (a vector [in bold face]) as 

p = mu/(l — u 2 /c 2 ) 1/2 , (2) 

where u is the particle velocity. As is required for 
consistency between relativistic and nonrelativistic 
laws of mechanics, Eq. 2 is effectively the nonrelativ¬ 
istic definition of momentum for the particle speed, 
u << c. Now to find p', the particle momentum, in 
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a frame moving with velocity v relative to the frame in 
which the particle has velocity u, one applies the rela¬ 
tivistic translation law for velocities: 

U' = (u-v)/(l-uv/c 2 ). (3) 

Substituting for u' in the expression for p' (Eq. 2 
with p and u primed), one obtains the relativistic 


translation law for momentum 


p' = (p-p°v)/(l-v 2 /c 2 ) 1/2 , 

(4) 

where 


p° = m/(l - u 2 /c 2 ) 1/2 . 

(5) 


Now, if total momentum is to be conserved in our 
two-particle inelastic collision in both the primed and 
unprimed frames, then P° = p°i + p° 2 must also be 
conserved. Again, using the relativistic translation law 
for velocities (Eq. 3) and the definition p° (Eq. 5), we 
find that 

/ = (P° - pv/c 2 )/(l - v 2 /c 2 ) 1/2 . (6) 

And so for the total quantities we want to be con¬ 
served we have 

P'= (p - p°v)/(l - v 2 /c 2 ) 1/2 and (7) 

P 0 ' = (p° - Pv/c 2 )/(l - v 2 /c 2 ) 1/2 . (8) 

Examining these expressions, it is clear that if P and 
P° are not changed after an inelastic (or elastic) colli¬ 
sion, then neither is P' and P° . 

In the limit of the speed u being much smaller than 
c, the difference between p° and m, (p° - m), 
approaches mu 2 /2c 2 . This result leads to a definition 
of relativistic kinetic energy, k, for a particle 

k = p°c 2 - me 2 , (9) 

which has the required property of reducing to the 
nonrelativistic form, mu 2 /2, in the limit of u much 
smaller than c. 

Returning to our two-particle inelastic collision, as 
P is conserved so is P°c 2 and thus from Eq. 9 

AMc 2 = AK, (10) 

where AM is the change in the masses of the inputs and 
outputs of the collision participants, and AK is the 


change in the kinetic energies of these “inputs and 
outputs.” Thus, Eq. 10 provides insight into the mean¬ 
ing of “E = Me 2 ” for the fission process. For n + 92 U 235 
i—> fission products + 202.7 MeV (the AK of Eq. 10 in 
unit of millions of electron volts) the percent change in 
mass can be estimated by dividing 202.7 MeV by the 
energy equivalents of the inputs (i.e., 236 amu, where 
1 amu = 931.141 MeV). The result is ~0.1%, which 
may not appear to be large until one makes a compar¬ 
ison with a chemical reaction. For example, 0 2 + C —> 
C0 2 + 4.1 eV. A similar calculation indicates a 1 x 
10 _8 % conversion of mass to kinetic energy. Since one 
could not measure such a small change in total input 
and output masses in chemical reactants, it is not 
surprising that the full impact of “E = Me 2 ” had to 
await demonstration in a nuclear reaction like fission. 
However, as will be discussed in the next three sections, 
the large energy release in fission, while conforming to 
Eq. 10, is due to the strength of the forces that hold 
a nucleus together and the charge repulsion forces that 
will accelerate two smaller nuclei as they are formed in 
the fissioning of a larger “parent” nucleus. 

Heavy Elements 

The “heavy elements” of particular importance to 
fission reactors are the radioactive nuclei, which are 
characterized by systematic chains of decay. In nature, 
there are three chains (series). In a given series, each 
nucleus has a mass number ; A, governed by a simple 
formula with the variable the integer n (see Table 1), 
and is identified with its longest lived isotope, that 
is, Thorium, Uranium, and Actinium (U 235 had not 
been discovered when the 4n + 3 series was identified). 
These longest half-lives are not surprisingly comparable 
to the age of the earth, 4.5 x 10 9 years. Half-life is one 
of three related parameters of radioactive decay pro¬ 
cesses, T 1/2 , X, and t. The fundamental equation of 
radioactive decay is 

-dN(t)/dt = MNT(t), (11) 

where X is the decay constant , and N(t) is the number of 
decaying nuclei at time t. The solution of Eq. 11 is 

N(t) = N(0)e“^. (12) 

The time when an original inventory of decaying 
nuclei, N(0), is halved is 
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T 1/2 = ln2/X = 0.693/F (13) 

And as the decay process is statistical the mean life¬ 
time , x, of a decaying nucleus is 

oo 

t = (1/N(0)) J N(0)A,te _Xt dt**l/X,, (14) 

0 

the reciprocal of the decay constant. 

With the search for transuranic elements through 
the bombardment of the heaviest natural elements, 
primarily with neutrons, a fourth decay series was 
identified, the Neptunium (A = 4n + 1) series whose 
radioactive members are not found in nature (see 
Table 1). 

Of the “heavy elements,” the isotope U 235 is key to 
fission reactor design. It is the only naturally occurring 
isotope which readily fissions when bombarded with 
neutrons of all energies. While its atomic percent abun¬ 
dance, 0.72%, is small, it is large enough to support 
chain reactions in reactors where neutrons born in 
fission are slowed down (moderated) by graphite 
(carbon) or by heavy water (deuterium oxide). When 
Uranium is enriched in U 235 (^3-5%), it can fuel 
reactor designs where ordinary water moderates fission 
neutrons (today’s pressurized water and boiling water 
reactors). Having U 235 available as a reactor fuel makes 
it possible to exploit the two abundant fertile “heavy 
elements,” U 238 and Th 232 . The term “fertile” refers to 
the fact that when these elements absorb a neutron they 
can be transmuted to fissile isotopes (Pu 239 and U 233 
respectively), which like U 235 readily fission when 
bombarded by neutrons of all energies. The transmu¬ 
tation processes are shown in Fig. 1. It is important to 
note that only one neutron capture is required in each 
of these transmutations. In a reactor design, neutron 


economy is the key to maintain a chain reaction and, as 
will be discussed in the section on Future Directions, 
expending one neutron with a reasonable probability of 
obtaining an additional fissile nucleus is a winner. 

The heavy element radioactive decay series are also 
important to safety in fission reactor design. Each of 
the decay processes, a and (3“ emissions and associated 
ys, is favorable to energy release. So any heavy elements, 
particularly transuranics, in a reactor’s fuel system will 
contribute to the decay heat load that must be dissi¬ 
pated when a reactor shuts down. As will be discussed 
in the next section, the major short-term contributors 
to decay heat are fission products. A power reactor that 
shuts down following a sustained run at full rating will 
initially produce ~7% of that rating from decay heat, 
even if the chain reaction and nearly all fissioning has 
ceased. 

For a full discussion of the radioactive decay series 
and the particulars of a, /F, (3 + and y emission, see 
Kaplan, and Krane, “Introductory Nuclear Physics.” 

Fission and Its Products 

As noted in the “Introduction,” fission was discovered 
accidentally during the search for transuranic elements. 
This work by Fermi and others was part of an extensive 
effort to understand the atomic nucleus and to dupli¬ 
cate the great success of quantum mechanics and 
the Pauli exclusion principle in providing a fully pre¬ 
dictive Theory of atomic electron structure. A compa¬ 
rable theory for the nucleus has not been developed, 
but several models (e.g., shell and liquid drop) provide 
insight into the trends and correlations found in 
the data provided by the extensive experimentation 
performed on the nuclei of the various elements and 
their isotopes. 



Fission Reactor Physics. Table 1 Heavy element decay series 


Series name 

Type 

Final stable nucleus 

Longest lived nucleus 

Longest half-life (years) 

Thorium 

4n a 

pb 2os 

Th 232 

1.41 x 10 10 

Uranium 

4n + 2 

pb 2os 

u 238 

4.47 x 10 9 

Actinium 

4n + 3 

pb 207 

u 235 

7.04 x 10 s 

Neptunium, not in nature 

4n + 1 

Bi 209 

Np 237 

2.14 x 10 6 


a n is an integer 
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Measurements of atomic mass (m(X A )), and the 
mass of the electron, proton, and neutron, yields the 
binding energy , B, of a nucleus, Z X A , the work (energy) 
required to disassemble a nucleus into its neutrons and 
protons: 

B = {Zm p + Nm n - (m(X A ) - Zm e )}c 2 , (15) 

where Z is the atomic number (the number of protons) 
and N = A - Z is the number on neutrons. (The binding 

23 min 

l/ 238 + n 4^239 _/\/p239 + p- + y 

\ 2.3dPu 239 + p~+ V 

22 min 

TT ? 232 + n -> Th 233 Pa 233 + p~ + v 

\ 27c/—> L/ 233 + p~ + v 

Fission Reactor Physics. Figure 1 

Transmutation of fertile to fissile nuclei, (v is the 
antineutrino, the chargeless, ~zero mass particle that 
accompanies ft emission) 


energy of atomic electrons is ignored as negligible com¬ 
pared to the other factors in Eq. 15.) 

Plotting the ratio of measured binding energies B to 
corresponding mass number A (Fig. 2) immediately 
makes evident the potential of energy release from 
fission of heavy element. Note the B/A versus 
A “curve” has a flat maximum in the middling A 

range ~50 -> 150, and falls off (decreases) as 

A increases. Thus, there is a potential energy excess if 
a heavy element (isotope) can be disassembled and 
reassembled as two mid-range isotopes (preserving 
total A, Z, and N). (The behavior of the B/A curve for 
light elements shows the potential energy release from 
fusion.) Obviously fission (nor fusion) does not take 
occur “naturally” on earth today (There is convincing 
evidence that a naturally occurring chain reaction 
took place in a uranium deposit in Gabon about 
2 x 10 9 years ago, when the abundance of U 235 would 
have been ~3%, high enough for a water-moderated 
“reactor” to operate. The higher earlier abundance is 
due to the shorter half-life of U 235 (7.0 x 10 8 y) relative 
to that of U 238 (4.4 x 10 9 y). See Krane for an excellent 



Mass number A 


Fission Reactor Physics. Figure 2 

Binding energy per nucleon (Krane) 
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discussion of the Gabon reactor). The remainder of this 
section is devoted to particular requirements for fission 
to take place and to the discussion of the resulting 
fission products and their energies. 

Insights provided by examining binding energies, and 
by additional experiments to determine nucleon-nucleon 
forces have led to the Shell and Liquid Drop models of the 
nucleus. Features of these models are incorporated in the 
semiempirical mass formula (Eq. 16). While a thorough 
discussion of the nuclear models is beyond the scope of 
this entry (see Kaplan or Krane), the mass formula 
provides key information on fission, energy release, 
and the relative likelihood for various nuclei. 

In the semiempirical mass formula, the binding 
energy has five terms, which will be discussed below. 

m( z X A ) = Zm p + Nm n 

— [Bo + Bi + B 2 + B 3 + B4] / c 2 . 

(16) 

B 0 = a v A is the volume energy. Note in Fig. 2 that B/A 
saturates, thus B 0 has a linear dependence on A. The 
attractive nuclear forces between nucleons (n-n, n-p 
and p-p) are all equal and short range, smaller than the 
radius of the nucleus, r = r 0 A 1/3 where r 0 ~1.2 x 
10 “ 12 cm. If the range were larger, there would be 
attraction between each nucleon pair and B 0 would 
depend on A(A - 1). 

Bi = -a s A 273 , is the negative surface decrement. As 
the nucleon-nucleon forces are “short range,” neutrons 
and protons on the surface of a nucleus are less tightly 
bound. 

B 2 = -a c Z(Z— 1 )/A 1/3 , is the coulomb repulsion 
decrement. While the nuclear forces are strong enough 
to overcome coulomb forces, the protons in the nucleus 
do repel and reduce binding energy. Assuming 
a uniform distribution of protons in a liquid drop 
model of a spherical nucleus, an electrostatics calcula¬ 
tion yields the dependence of B 2 the number of proton 
pair, Z(Z— 1 ), and a measure of their spacing, A 1/3 . 

B 3 = -a a IN-ZIIN-ZI/A, is the neutron-proton pop¬ 
ulation asymmetry decrement. As nuclei become 
heavier, more neutrons than protons are needed to 
overcome coulomb repulsion. However, as the shell 
model of the nucleus demonstrates when nucleons, 
neutrons and/or protons, are added to form heavier 
elements and their isotopes, they fill shells of 


successively higher energy and are thus less tightly 
bound. This is analogous to the case of atomic elec¬ 
trons. Neutrons and protons have half-integral spin like 
the electrons, and therefore no two neutrons (or pro¬ 
tons) can occupy the same state in a nucleus in confor¬ 
mance with the Pauli exclusion principle. So B 3 is 
negative and proportional to the neutron excess and 
the fraction of the nucleus the excess represents. 

B 4 = +5A -3/4 for even Z even N nuclei, = 0 for odd 
A nuclei, = -5A _3/4 for odd Z odd N nuclei, is the 
pairing energy. As nucleons are added and fill shells, 
they are more tightly bound as spin up and spin down 
pairs. B 4 is important in determining the relative bind¬ 
ing of isotopes of a given element and their propensity 
to fission. 

A set of parameters for B which best fit the 
B/A curve (Fig. 2) is provided by Krane; a v = 15.5 MeV, 
a s = 16.8 MeV, a c = 0.72 MeV, a a = 23 MeV, and 
5 = 34 MeV. 

The potential for, and magnitude of, energy release 
from fission, whether as spontaneous decay or induced 
by particle or gamma ray capture, can be assessed with 
the semiempirical mass formula. As for an estimate of 
the magnitude of energy release, the B/A curve, as 
noted earlier, can be used directly. For example, the 
B/A for U 238 is ~7.6 MeV. If it fissioned into two 
approximately equal mass nuclei (A = 119), their B/A 
would be ~8.5 MeV when in a ground state, and being 
more tightly bound than their parent (U 238 ) 214 MeV 
(= 2 x 119 x 8.5 — 238 x 7.6) will be available through 
conservation of energy as kinetic energy of the daugh¬ 
ter nuclei and of other fission products (neutrons, (3s, 
ys, and neutrinos). That this energy is available does 
not mean that there is a significant probability that 
fission occurs. In this example, which represents spon¬ 
taneous fission of U 238 , one finds in nature that this 
mode of U 238 decay competes poorly with a decay 
(Spontaneous fission is a significant mode of decay 
for some transuranic isotopes found in depleted reac¬ 
tor fuel, particularly Pu 240 and Pu 241 .). For fission 
fragments, daughter nuclei, to separate in spontaneous 
or induced fission, a potential barrier must be over¬ 
come. The height of the barrier relative to the ground 
state of a fission parent nucleus is called the fission 
activation energy (E a ). It can be estimated with the 
liquid drop model by calculating the change in the 
parent nucleus binding energy (B x and B 2 ) between 
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Fission Reactor Physics. Table 2 Heavy nuclei fission 


Target 

nucleus 

Compound 

nucleus 

E e , Excitation 
energy (MeV) 

E a , Activation 
energy (MeV) 

u 233 

[U 234 ] 

6.6 

4.6 

u 235 

[U 236 ] 

6.4 

5.3 

Pu 239 

[Pu 240 ] 

6.4 

4.0 

u 238 

[U 239 ] 

4.9 

5.5 

Th 232 

[Th 233 ] 

5.1 

6.5 


the ground-state spherical configuration and a volume- 
conserving dumbbell configuration (ref. [2] and [6]). 
Table 2 contains values of E a for the compound nuclei 
formed by neutron capture in the fissile and fertile 
isotopes of primary interest in reactor design. These 
are compared with the excitation energy (E e ) provided 
in forming the compound nucleus. 

E e = [(m( z X A ) +m n ) - m( z X A+1 )]c 2 . (17) 

Note that E e does not include any kinetic energy 
contribution from the captured neutron. For the 
fertile target nuclei (U 238 and Th 232 ), E e < E a and 
neutron kinetic energy will be required to overcome 
or quantum mechanically penetrate (with high proba¬ 
bility) the potential barrier to fission. For the fissile 
targets, E e > E a and thus “slow” neutrons can initiate 
fission. 

The high values of E e for the fissile targets are due to 
the positive “pairing” contribution, B 4 , to the binding 
energy of the compound nucleus ground states. Note 
92 U 234 , 92 U 236 , and 94 Pu 240 are all even Z even N nuclei 
and the corresponding target nuclei are even Z odd N. 
So, the second term in Eq. 17 is decreased by 8(A + 
1 ) _3/4 , and B 4 is zero in the first term. Thus, an increase 
in E e relative to the result if pairing is ignored is 
achieved. For fertile targets (even Z even N), roles are 
reversed. It is the first term in Eq. 17 that is decreased 
and B 4 is zero in the last term. Thus, E e is lower than if 
pairing is ignored. 

The semiempirical mass formula and the shell and 
liquid drop models are limited in predicting the fission 
process. This is best illustrated by the mass distribution 
of the major fission fragments (see Fig. 3). In the vast 
majority of cases, fission yields two unstable (having 


excess neutrons) nuclei, but not of equal mass, as in the 
example above used to estimate the energy available 
from spontaneous fission of U 238 . The two humped 
curves in Fig. 3 are not predicted by nuclear models. 
To quote Krane, “surprisingly, a convincing explana¬ 
tion for this mass distribution has not been found.” 

From the nuclear models, it is not surprising that 
free (prompt) neutrons are emitted in fission as the 
daughter nuclei are so rich in neutrons, but the predic¬ 
tion of their number (~2.5 on average) and energy 
spectrum (the mean ~2 MeV, see Fig. 4) are still an 
active area of study. The decay chains of the neutron- 
rich, excited daughter nuclei (fission fragments) are 
well predicted, including the release of (delayed) neu¬ 
trons when in some cases neutron decay competes 
successfully with (3-decay. The delayed neutrons are 
a small fraction of the total neutron emission (0.64% 
for thermal fission of U 235 ), but as will be discussed in 
section “Fission Reactor Performance”, they are impor¬ 
tant to reactor control. 

Total energy release from the various neutron- 
induced fissions of interest in reactor design is remark¬ 
ably consistent with the simple spontaneous U 238 
fission calculation made above. Of course, the constit¬ 
uents are different, as displayed in Table 3. 

In a reactor design, the total energy values in Table 3 
are not used. First, the contribution from neutrinos is 
subtracted, as their range before collision is well beyond 
reactor boundaries. Then, the energy release per fission 
from neutron captures which produce (3s and ys is 
added. The magnitude of this release is design- 
dependent as it is a function of the materials used, 
and the neutron capture rate in these materials. For 
plant energy balance studies, using 200 MeV/fission is 
satisfactory. 

The problem of decay heat was noted in the previ¬ 
ous section. From Table 3, it can be seen that fission 
product decay is the immediate concern when a chain 
reaction is terminated. Assume full power from U 235 
fissioning, when this ceases, delayed ys and (3s are still 
being released. Thus, ~6.3% (~ 100 x (6.26 + 6.43)/ 
200 ) of rated power, coming from fission product 
decay, must still be dissipated, along with energy from 
the decay of transuranic elements present in the reactor, 
for a total of ~7%. 

Over 800 fission fragment nuclei have been identi¬ 
fied. Their decay must be tracked to account for decay 
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Fission Reactor Physics. Figure 3 

Fission yields: (a) for U 235 from fast and thermal neutrons, (b) for U 233 and PU 239 from thermal neutrons [33] 


heat in reactor shut-down safety analysis and for 
the proper handling and storage of spent fuel (where 
both energy release and the nature of radiation fields 
must be known). One hundred and two of these 
nuclei are delayed neutron precursors. To simplify reac¬ 
tor transient (kinetics) calculations, the precursors 
are collected into six effective groups, where members 
of a given group have similar decay constants (see 
Table 4). 

The energy spectra for a given delayed group do not 
vary significantly with fissioning isotope. The spectra are 
much softer (with lower mean energies, < 1 MeV) than 
for prompt neutrons [7]. This means that a delayed 
neutron in a thermal reactor is more important than 
a prompt neutron. It is more likely to reach the low 
energies (<0.625 eV) where most fission occurs. 
Delayed neutrons can also result from other reactions, 
for example, photon capture (y,n) (The expression 
(a,b) is shorthand for a nuclear reaction with an input 
particle “a” and output particle “b”, where the target 
and product nuclei are understood.) and neutron acti¬ 
vation (n,p) followed by neutron decay of the product 


nucleus. If important to a particular reactor design, 
these delayed neutrons can be included by modifying 
the effective delayed group structure. 

The final aspect of the ~800 fission fragment nuclei 
that must be dealt with is their impact on neutron 
balance. Each of them has a probability of capturing 
neutrons and in some case of causing a transmutation 
into a nucleus with a particularly large propensity for 
capturing neutrons. The nuclei of greatest importance 
to neutron balance are listed in Table 5. 

I 135 is important as it is the direct precursor of 
Xe 135 , an especially large absorber of thermal neutrons. 
The next two isotopes in the table are precursors to 
a decay chain with three large absorbers, Pm 147 , Sm 149 , 
and Sm 151 . The final five, with their precursors in 
parentheses, are large absorbers, but not as sensitive 
to neutron energy spectrum and power level and his¬ 
tory as the others. Clearly, data for the 800 fission 
fragments must be handled through large computer 
files [8]. For neutron balance, the fission fragment iso¬ 
topes, which are not treated explicitly (Table 5), can be 
lumped into an effective fission product nucleus with 
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U-235 ENDF / B MATERIAL NO. 1395 
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Fission Reactor Physics. Figure 4 

Prompt neutron energy spectra where P f (E') is the probability per unit energy [32] 


a yield per fission and probability for neutron capture. 
How one characterizes the probability of nuclear reac¬ 
tions is the subject of the next section. 

Cross Sections 

The nuclear reactions of importance to fission reactor 
design are by definition governed by the postulates of 
quantum mechanics (i.e., they are on the dimensional 
scale of the nucleus). And, thus the results of the var¬ 
ious reactions are probabilistic in nature. The proba¬ 
bility of a particular result is characterized by a 
parameter, the microscopic cross section, a, with, not 
surprisingly, the dimensions of area, and which is 
quoted in units of barns. The barn, 10 -24 cm 2 , is a rea¬ 
sonable measure as in some cases a is nearly the 


projected area of a target nucleus, 4 tiR 2 , and R is 
~10 -12 cm. Thus, envisioning a target foil of area, A, 
and thickness, dx (where dx is small enough to have 
negligible shadowing of one nucleus by another in the 
target foil), the probability that an incident particle in 
traveling a short distance (i.e., dx) will undergo 
a specific reaction equals apAdx/A, where p is the 
density of target nuclei (#/cm 3 ) in the foil. It follows 
that to find the reaction rate in the foil we need the 
number of impinging particles per second. Given 
the particles have a density N (#/cm 3 ) and are 
monoenergetic and monodirectional (normal to the 
face of the foil) with speed v, the number impinging 
per second equals N *(vdt)* A/dt. So the total reaction 
rate in the foil is (NvA)(padx), and the rate per cm 3 is 
vNpa. The parameters that make up this specific rate 
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Fission Reactor Physics. Table 3 Energy yield in MeV from fission. Thermal neutron fission of fissile isotopes; fertile 
isotope fission is from neutrons with energy spectrum of a light water-moderated reactor [7, 32] 



Fissile or fertile isotope 1 





Energy 

232 Th 

238u 

235 u 

233u 

239 Pu 

241 Pu 

Fission fragment 
kinetic energy 

162.1 ± 1.5 

170.0 zb 0.66 

169.6 zb 0.7 

168.7 zb 0.7 

175.9 zb 0.1 

175.5 zb 1.1 

Prompt neutron 
kinetic energy 

4.7 ± 0.12 

5.51 zb 0.1 

4.79 zb 0.07 

4.9 zb 0.1 

5.9 zb 0.1 

5.99 zb 0.13 

Delayed neutron 
kinetic energy 

0.024 zb 0.004 

0.021 zb 0.003 

0.0071 zb 0.0007 

0.0014 zb 0.0005 

0.001 zb 0.0005 

0.0074 zb 0.0015 

Prompt y rays 

6.15 zb 1.75 

6.28 zb 0.8 

6.96 zb 0.7 

7.59 zb 0.71 

7.74 zb 0.45 

7.86 zb 1.8 

Delayed y rays 

8.01 zb 0.2 

8.04 zb 0.08 

6.26 zb 0.05 

4.99 zb 0.04 

5.16 zb 0.1 

6.33 zb 0.07 

(3 particles 

8.28 zb 0.21 

8.25 zb 0.08 

6.43 zb 0.05 

5.13 zb 0.04 

5.3 zb 0.1 

6.51 zb 0.04 

Neutrino energy 

11.1 zb 0.3 

1 1.14 zb 0.11 

8.68 zb 0.06 

6.91 zb 0.05 

7.15 zb 0.11 

8.78 zb 0.09 

Total energy 
yield 

200.3 zb 0.5 

209.3 zb 0.3 

202.7 zb 0.1 

198.2 zb 0.1 

207.2 zb 0.3 

211.0 ±0.3 



Fission Reactor Physics. Table 4 Prompt and delayed neutron data. v p is the prompt neutron yield versus initiating 
neutron energy (ENDF/B-VII.O). v d is the total delayed yield, constant for fission initiating neutron energies between 0 and 
4 MeV, and above 7 MeV. Between 4 and 7 MeV, v d tracks linearly [32] 


The relative abundance 71 = 

I’i/Vd 







Neutron E 

233 Ui 

; p 

235 U v p 

239 Pu Vp 

241 Pu Vp 

0.0253 eV 

2.4894 

2.4208 

2.8724 

2.9251 

2 MeV 

2.5758 

2.6366 

3.171 

3.1727 


* 

E(MeV) 


233 u 

235 u 

239 Pu 

241 Pu 

0-4 

0.0074 

0.0167 

0.00645 

0.0162 

7 

0.0047 

0.0090 

0.0043 

0.0084 

233 U 

235u 

239 Pu 

241 Pu 

Delayed 

Group 

Decay Constant 
\j, s 1 

Relative 
Abundance y t 

Xi, s 1 

Yi 

\j, s 1 

Yi 

Xj, s 1 

Yi 

1 

0.01258 

0.086 

0.01272 

0.038 

0.0129 

0.038 

0.0128 

0.010 

2 

0.03342 

0.274 

0.03174 

0.213 

0.0311 

0.280 

0.0299 

0.229 

3 

0.131 

0.227 

0.116 

0.188 

0.134 

0.216 

0.124 

0.173 

4 

0.303 

0.317 

0.311 

0.407 

0.332 

0.328 

0.352 

0.390 

5 

1.27 

0.073 

1.4 

0.128 

1.26 

0.103 

1.61 

0.182 

6 

3.14 

0.023 

3.87 

0.026 

3.21 

0.035 

3.47 

0.016 
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Fission Reactor Physics. Table 5 Direct yield fractions ( xlOO) for isotopes in the most important fission product 
chains [32] 



Fissile or fertile is 

otope a 




Fission Product 

232 Th 

238u 

235u 

233 u 

239u 

135, 

5.238 

6.548 

6.349 

4.860 

6.303 

135 Xe 

0.0403 

0.0150 

0.255 

1.337 

1.152 

147 Nd 

3.08 

2.711 

2.271 

1.775 

2.073 

149 Pm 

0.825 

1.765 

1.089 

0.769 

1.261 

"Mo ( 99 Tc) 

2.965 

6.247 

6.127 

4.957 

6.144 

103 Ru ( 103 Rh) 

0.164 

6.336 

3.137 

1.707 

6.991 

i3i|(i3i X e) 

1.481 

2.982 

2.473 

2.352 

3.093 

133 l ( 133 Cs) 

3.858 

6.356 

6.787 

5.974 

6.923 

143 Ce ( 143 Nd) 

6.619 

4.834 

5.972 

5.881 

4.561 


a The energy of the neutron initiating the fission is in the thermal range for 235 U, 233 U, and 239 Pu. For 232 Th and 238 U, the yields are due to 
fissions initiated by neutrons with a spectrum of energies typical of light water-moderated nuclear reactors. 


have been reordered to reflect conventional definitions 
in reactor physics (In the nuclear engineering disci¬ 
pline, reactor physics refers to the portion of the field 
addressed in this entry): 

vN = 'Fparticle flux , and (18) 

po = Z macroscopic cross section. (19) 

Th e flux in our simple foil example is the number of 
particles per cm 2 per second crossing a plan parallel to 
the face of the foil. Given the more general representa¬ 
tion of particle density (which will be used in the next 
section): 

N(r,E,12,t)dr 3 dEdQ = no. of particles in dr 3 about 
r, with kinetic energies in dE about E, and going in the 
solid angle dQ about the unit direction vector 12 (see 
Fig. 5), at time t; then the corresponding definition of 
flux , x F(r,E,fl,t)dsdEdQ, is the no. of particles with E in 
dE going in direction Q in dQ that pass through the 
surface ds, which is located at r and is normal to 12, per 
unit time, at time t. 

The macroscopic cross section is the probability that 
a particle undergoes a reaction characterized by a, 
per unit path (for small paths, dx) traversed by the 
particle in a homogenous material with target nucleus 
density, p. This definition lends a special significance to 
1 /Z T , where Z T = pa T . a T is the total microscopic cross 



Fission Reactor Physics. Figure 5 

The unit direction vector Q associated with neutron 
velocity and the differential (small) solid angle dQ which 
defines the range of directions 

section, the sum of the as for all of the reactions that 
the initiating particle can undergo with a given target 
isotope. So the change in flux, where 12 is parallel to the 
x axis of a target material sample, over a small 
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interval dx in the sample is d'F = - v PX T dx. And thus, 
'F(x) = 'F(O) e _2rX , where x is the distance into the 
“sample” (which has its face in the y - z plane at x = 0 ). 
So, the probability of a reaction in dx about x can be 
expressed as 

P(x)dx = Zxdx» ('f / (x)/ x F(0)) = 'Lje~ IiTX dx. 

( 20 ) 

And thus the mean free path of a particle in an 
incident beam ('F(O)) before being removed from the 
beam in a homogeneous target is 

(X) 

v = j dxxP(x) = 1 /Zt- (21) 

0 

or generally the mean free path is the average distance 
traveled between successive interactions. 

If a homogeneous material is made up of various 
nuclei (elements/isotopes, indexed by j), then the mac¬ 
roscopic cross section for a reaction, i, is 

( 22 ) 

j 

The reactions of primary interest in fission reactor 
design are those initiated by neutrons and gammas. 
Neutron cross sections are key to determining if 
a chain reaction can be maintained, and that the neu¬ 
tron population can be controlled under various tran¬ 
sient conditions (e.g., start-ups and shutdowns, 
planned and accidental), and, of course, the fission 
distribution in the reactor. Most of the resulting energy 
release, from fission fragments, is deposited locally in 
the fuel elements of a given design. However, gammas, 
from fission and neutron capture in reactor structures, 
have large mean free paths, and their distribution and 
capture rates must be determined, using gamma cross 
sections, to complete the knowledge of energy deposi¬ 
tion. The subsequent engineering problem is to assure 
that the reactor cooling system can remove the depos¬ 
ited energy under normal and accident conditions. 
Neutron and gamma cross sections are also required 
for the shield design of a fission reactor. 

Neutron reactions are characterized by their energy 
balance, the Q factor, as well as microscopic cross 
sections. For the simple reaction (with the target at rest), 

n + X —> Y + y (23) 


the energy balance is 

(E n + m n c 2 ) + M x c 2 = (Ey + MyC 2 ) + (Ey + myC 2 ), 

(24) 

and Q is defined as the difference in the kinetic energies 
of the inputs (here the neutron) and the outputs: 

Q = E y + E y - E n or = (M x + m n - My - m y )c 2 . 

(25) 

If Q is positive, the reaction is exothermic, if nega¬ 
tive, endothermic. For an endothermic reaction to go, 
for the microscopic cross section to be nonzero, 
enough kinetic energy must be supplied by the neutron 
to excite a compound nucleus, X A+1 , so it will decay to 
Y + y. As momentum must be conserved, 

m n v n = (M x + m n )V c or V c = v n m n /(M x + m n ), 

(26) 

where V c is the velocity of the compound nucleus. 
Then, the neutron energy supplied must be such that 

- Q = m n v n 2 /2 - (M x + m n )V c 2 /2 (27) 

and the threshold energy , E th , for the reaction is 

E t h = m n v n 2 /2 = ( - Q)(l + m n /Mx). (28) 

(n, 2 n) is an example of an endothermic reaction whose 
cross sections will exhibit an energy dependence of zero 
until the neutron energy E reaches an E th . 

The simplest, but very important, neutron reaction 
to be considered is a form of elastic scattering (Q = 0), 
where collisions can be treated with classical mechanics 
as hard sphere, billiard ball, interactions. For the ener¬ 
gies of neutrons in fission reactors, 0-10 MeV, elastic 
scattering cross sections for most nuclei are constant 
and proportional to the square of the nuclear radius, 
~A2/3. Assuming the target nucleus to be at rest and 
applying conservation of energy and momentum in the 
center of mass, CM, coordinate system, one determines 
the probability that the final energy of the scattered 
neutron, in the laboratory coordinate, LM, system, is E f 
in dE f : 

P(Ei E f ) = 1/(1 - a)Ei, for aE { < E f < E* 

= 0 otherwise, 

where a = ((A—1)/(A+1))2 and Ei is the initial neutron 
energy. A is the mass number of the target nucleus. 
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And, scattering is assumed to be isotropic in the CM 
coordinate system. This is a good assumption for the 
energy range of interest here, and its basis will be 
discussed later in this section. A full derivation of 
Eq. 29 can be found in Duderstadt and Hamilton, 
“Nuclear Reactor Analysis.” Examining P(Ei=>E f ) one 
sees that a neutron scattering off a hydrogen nucleus 
(A = 1) can lose all its energy (as a = 0). On average, it 
loses half its initial energy as 

Ei 

E f = / dE f E f P(Ei —> E f ) = Ei(l + a)/2,and (30) 

ocEi 

AE = Ei -► E f = Ei(l - a)/2. (31) 

Given P(Ei —> Ef) as in Eq. 29, one defines differen¬ 
tial microscopic elastic scattering cross sections, 
a e s j (Ei)P(Ei^Ef)dE f , which are particularly useful in 
determining how neutrons, born in fission, are slowed 
down in reactors designed to take advantage of the 
large fission cross sections of fissile isotopes in what is 
conventionally defined as the thermal neutron energy 
range, less than 0.625 eV. The superscript “j” of <jJ 
refers to the nuclei of the various moderators (hydro¬ 
gen, deuterium and carbon) that are employed in these 
thermal reactors. 

Once neutrons have slowed to the thermal range the 
target nuclei can no longer be assumed to be at rest. The 
interaction frequency will then be 

|v — V| cr(|v — V|)p, (32) 

where |v- V| is the relative speed of neutron and target. 
For elastic scattering, a(|v - V|) is still nearly constant 
and an average cross section for thermal neutrons with 
speed v (=(2E/m n )l/2) is 

ff(v) = (o es /\p) f d 3 V|v- V|p(V), (33) 

where for many reactor applications the Maxwell- 
Boltzmann velocity distribution for ideal gases in ther¬ 
mal equilibrium at absolute temperature, T, can be 
used to represent the targets. Thus, 

p(V) = p • (M/(27ikT)) 3/2 exp(—MV 2 /2kT), (34) 

where M is the mass of the target nucleus and k is the 
Boltzmann constant (8.6174 x 10 -5 eV/K, K is degrees 
Kelvin). 


From Eq. 33, one sees that for v ^ V the average 
cross section is, as expected, a es . And, as the neutron 
speed decreases and approaches zero, the average cross 
section goes as one over the neutron speed. 

For highly accurate calculations (As part of the 
process of evaluating nuclear data sets, very accurate 
calculations of integral experiments are made. Zero 
power mockups of reactors, with carefully recorded 
dimensions and inventories, are commonly used. 
Monte Carlo calculations (to be discussed in the next 
section) of neutron balance in the mockups are made 
with various data sets (e.g., cross-section libraries) to 
determine a recommended set. See the CESWG web 
site for references to such experiments.), more sophis¬ 
ticated treatments of scattering from moderator struc¬ 
tures (e.g., molecules in liquids, lattices for solids) are 
required. The excitation of modes of vibration, and 
thus energy loss to phonons must be considered. This 
has been a fertile field of development [9] and double 
differential scattering cross sections for various mod¬ 
erators have been produced. They are of the form: 

<r s (Ei E f , Qi -► Qf)dE f dQ f = (l/47ikT) 
(Ef/Ei) 1/2 exp(-/?/2)(7 es S(a,/?)dE f dQf, 

(35) 

where 

a = (Ei + Ef - 2(EiE f ) 1/2 )Qi • Q f /kT and 
P = (Ei + Ef)/kT. 

<j es is the scattering cross section of the bound “moder¬ 
ating” nucleus (e.g., proton, deuteron, carbon). S(oc,(3), 
the scattering law , embodies the physics of the influence 
of the moderator structure on the scattering process. 
Various formulations of S(a, (3) are tabulated as part of 
data files that document all the microscopic cross sec¬ 
tions that are used in fission reactor design. These files 
can be found on the web site of the National Nuclear 
Data Center (currently, nndc.bnl.gov). The most widely 
used set is ENDF/B, the latest (2009) version is VII.0. In 
order, however, to produce the differential scattering 
cross sections (Eq. 35) for design calculations, material 
temperatures, T, must be identified and supplied with 
the corresponding ENDF files to NJOY [10], a system 
of computer programs which produce microscopic 
cross sections for use in various design programs 
(which will be discussed in the next section). 



Fission Reactor Physics 


F 


3801 


The remaining neutron reactions of interest all 
involve the formation of a compound nucleus, which 
will be in an excited state, (X A+1 )*, and will subse¬ 
quently decay, yielding y or y's (neutron capture), 
n (elastic neutron scattering), n+y (inelastic scatter¬ 
ing), two ns (an (n,2n) reaction), p or a (charged 
particle production) or fissioning. The probabilities of 
these various outcomes for a given isotope, j, and 
incident neutron energy are characterized by the 
microscopic cross sections: a c j (E), a s j (E), a in j (E), 
a 2 n(E), cr p J (E), a a j (E), and a f j (E). As noted above in 
the discussion of Q factors, for endothermic reactions, 
Q < 0, cross sections will be zero until a threshold value 
of E for the initiating neutron is reached. This is the 
case for inelastic scattering, (n,2n) and some (n,oc) and 
(n,p) reactions. There is similar threshold behavior for 
fertile isotope fission cross sections (see Sect. Fission 
and Its Products), which when the reactions “go” are 
exothermic. All neutron capture reactions (n,y) are 
exothermic, and thus their cross sections are nonzero 
over the full range of fission reactor neutron energies. 

One can view the “compound nucleus reaction” 
cross sections as the product of a cross section for 
compound nucleus formation, a c (neutron capture 
by the target nucleus), times the probability of 
a particular decay mode of the excited compound 
nucleus. Both factors of this “product” depend on 
the nature of the target and compound nuclei, X A and 
X A+1 , and the energy available to excite the compound 
nucleus, X A+1 . The later is the sum of the reduced mass 
(i.e., center of mass) kinetic energy of the initiating 
neutron: 

E c = E(M x /(m n + M x )) ^ E(A/(1 + A)), (37) 

where X A is assumed to be at rest and momentum is 
conserved; and the excitation energy, E e (see Eq. 17), 
provided by adding a neutron to X A . E e is the binding 
energy of the “added” neutron in the compound 
nucleus. 

The magnitude of a c depends on the structure of 
X A . First, if neutron number N (=A-Z) is odd, a c is 
larger than its counterpart for neighboring isotopes 
with even neutron numbers. The opposite is true for 
N even. This just reflects the binding energy advantage 
of pairing half-integral spin Fermions in a nucleus (see 
the discussion of B 4 in Sect. Fission and Its Products). 
Second, for nuclei of various As there are Magic 


Numbers for both Z and N (2, 8, 20, 50, 82 and 126) 
which can be thought of as closing shells of protons and 
neutrons, analogous to atomic electron shells. The 
reduction of a c for a magic number N nucleus, relative 
to its N + 1 isotope neighbor’s a c > is much larger than 
the pairing effect. 

Excited compound nuclei have mean lifetimes, x 
(see Eq. 14) of as long as 10 -14 s (Kaplan), much 
longer than the transit time for a neutron crossing a 
target nucleus, ~2R/v. Given the nuclear diameter, 
2R ~ lO -12 cm, the transit time for even a thermal 
neutron is ^10 -17 s (The term thermal neutron refers 
to the most probable energy of the neutrons in thermal 
equilibrium in a zero-power reactor (e.g., a mockup). 
At 20°C, this is 0.023 eV, with a corresponding neutron 
speed of 2,200 M/s.). Thus, the standard assumption is 
that the decay of an excited compound nucleus is 
independent of all but the input energy of the initiating 
neutron. The decay modes of a particular excited state, 
nuclear level, are characterized by a level width 

r = h/(27rr), (38) 

with dimensions of energy (h is Planck’s constant, 
4.135667xl0 -15 eV-s), which is based on the 
Heisenberg uncertainty principle. In a quantum 
mechanical system like our excited compound nucleus, 
knowledge of energy and time is governed by 

AEAt - h/27i. (39) 

Thus, r can be viewed as the uncertainty in energy 
of an excited state (level) of a compound nucleus, and x 
a measure of the “uncertainty” of the lifetime of the 
excited state. The microscopic neutron cross sections, 
which go through the compound nucleus formation 
process, exhibit resonance behavior (peaking) when the 
neutron energy and E e (the added neutron binding 
energy) produce or nearly produce a well-defined 
excited state (i.e., having a small T). See Fig. 6. 

The level width T can be thought of as the proba¬ 
bility per unit time of decay of an excited state and thus 
the sum of partial “widths” (probabilities per unit 
time) for each mode of decay: 

r = ry + r n + r ny + Tf + r 2n + r p +..., (40) 

(where r n refers to elastic compound scattering 
and r ny refers to inelastic scattering, the rest being 
obvious). 
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Fission Reactor Physics. Figure 6 

A typical neutron capture cross section for an isolated 
(single) resonance whose width at half maximum is r, the 
total level width. r y is the partial width for gamma ray 
emission from the excited state (level). E c is the center of 
mass (reduced mass) energy of the initiating neutron 
(Duderstadt) 

Therefore, a “compound nucleus reaction” cross 
section near an isolated resonance is 

(r(n, i) = <T C (n)ri/r, for i = y, n, (ny), f, 2n, p, etc. 

(41) 

The functional form of a (n,i), its dependence on 
neutron energy, was derived with the principles of 
quantum mechanics by Brief and Wigner [11] in 
1935. Their “formula” for this simple case is 

<x(n, i) = (V/47t)r n ry [(e - e 0 ) 2 + (r/ 2 ) 2 ], 

(42) 

where X is the de Broglie wavelength of the neutron, 
h/(2m n E) 1/2 , and E 0 is the energy of the resonance 
peak. Figure 6 is an illustration of this “form” for i = y. 

Breit and Wigner’s most impressive derivation is 
more general than Eq. 42. 

First, they considered neutron energies beyond 
what has been found pertinent to fission reactors. 
When one accounts for conservation of angular 
momentum, the initiating neutron has classically 
a magnitude of angular momentum |L| equal to pb, 
where p is neutron momentum, (2m n E) 1/2 , and b is the 


displacement of the neutron path from a parallel axis 
running through the center of the target nucleus. In 
a quantum mechanical treatment, 

|L| = (/(/ + l)) 1//2 h/27T where / = 0,1,2, 3,... 

(43) 

Then, one can think of “b” as |L| (given by Eq. 43) 
divided by the neutron momentum, p, and if there is 
going to be a significant probability of a reaction 
with the target nucleus, “b” cannot be much larger 
than the target nucleus radius, r = A 1/3 (1.28 =b 0.05) 
x 10“ 13 cm. For this to be true for a large nucleus, for 
example, for U 235 , and for nonzero angular momen¬ 
tum (e.g., / = 1), the neutron would have to have kinetic 
energy > 6.6 MeV. For smaller nuclei the required 
energy would be greater. Given the spectrum of neu¬ 
trons in fission reactors, where most neutrons are born 
at around 2 MeV (see Fig. 4), an assumption of zero 
angular momentum (1 = 0) for the vast majority of 
reactions is good, and thus equation Eq. 42 does not 
include a factor involving angular momentum or spin 
quantum numbers. This assumption also means that 
decay products of an excited compound nucleus will be 
released isotropically in the center-of-mass coordinate 
system, which is reflected in the factor of 1/4 ti in Eq. 42 
(the probability that the decay product i (i = n, y, p) is 
released dQ about any H). The quantum mechanical 
treatment of angular momentum also accounts for the 
statement made above that “billiard ball” elastic scat¬ 
tering “can be assumed to be isotropic in the center of 
coordinate system.” This direct elastic scattering is 
referred to as potential scattering so as to be differenti¬ 
ated from resonance (compound nucleus) elastic scat¬ 
tering, that is, i = n in equation Eq. 42. 

Second, Breit and Wigner recognized and treated 
interference between potential and resonance elastic 
scattering. They found that the total elastic scattering 
cross section dips at energies right below the resonance 
peak, E 0 . 

Finally, as they were aware that there could be 
multiple possible excited states of a compound nucleus 
they extended their “formula” to two resonances whose 
Ts do not over lap. 

Since Breit and Wigner’s original work, there has 
been great activity in measuring cross sections, moti¬ 
vated principally the desire to understand the physics 
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of the nucleus. In the process, however, the basic 
parameters required for nuclear weapon and reactor 
design were generated. The neutron cross sections for 
fission reactor design are summarized in Table 6 [12]. 

In this table, the distinction is made between reso¬ 
nance cross sections with different densities (spacing) 
of resonance peaks. With intermediate and heavier 
nuclei the level structure grows more complex, and 
the number of possible excited states of a compound 
nucleus greatly increases. With higher neutron energy 
more finely spaced excited states can be reached and 
their level width, Ps, increasingly overlap until mea¬ 
surement cannot resolve individual resonances. 

In parallel with the work of nuclear spectroscopy 
experimentalists, theoreticians have built on Breit and 
Wigner’s work. Resonance cross-section models [13] 
are key to creating Evaluated Nuclear Data Files. 


The Cross Section Evaluation Working Group, 
a cooperative effort of national laboratories, industry 
and universities in the United States and Canada (see 
nndc.bnl.gov), sponsors reviews of the various mea¬ 
surements of a given cross section (target isotope and 
reaction) and the subsequent determination (As part of 
the process of evaluating nuclear data sets, very accu¬ 
rate calculations of integral experiments are made. 
Zero power mockups of reactors, with carefully 
recorded dimensions and inventories, are commonly 
used. Monte Carlo calculations (to be discussed in the 
next section) of neutron balance in the mockups are 
made with various data sets (e.g., cross-section librar¬ 
ies) to determine a recommended set. See the CESWG 
website for references to such experiments.) of a con¬ 
sensus set of parameters for an appropriate cross- 
section model. These models and their “consensus” 



Fission Reactor Physics. Table 6 Types of neutron cross section for various target element/isotope masses pertinent to 
fission reactor design 


Slow Epithermal Fast 

neutrons neutrons neutrons 

E < 1 eV 1 eV< E<0.1 MeV 0.1 MeV < E< 20 MeV 


Light 
nuclei 
A <25 


Intermediate 

nuclei 

25 < A < 80 


Heavy 
nuclei 
A <80 


Separated resonances 


Potential scattering 

Resonance scattering, (n, 2 n), ( n , p) 


Separated Overlapping Continuum 


resonances 

resonances 

resonances 

Resonance scattering, r 

adiative capture 

^1 






Potential scattering 


Inelastic scattering 


Separated Overlapping Continuum 


resonances 

resonances 

resonances 

Radiative capture 




Inelastic scattering, (n, 2 n) 
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parameters are a large part of the ENDF/B-VII.O data 
files. Having the cross sections represented by an ana¬ 
lytic model also facilitates dealing with the temperature 
effect on resonance cross-sections, that is, Doppler shift 
or broadening. The analytic process of averaging 
a resonance cross section (i.e., its model), over the 
velocity distribution of the target nuclei at a given 
temperature is similar to what was discussed above 
for reactions initiated by neutrons in thermal energy 
range. The process is outlined by Duderstadt and Ham¬ 
ilton using the single-level Breit Wigner formula as the 
resonance model. The effect of increasing temperature 
is to reduce a resonance peak while broadening its 
width, thus increasing its T. To first order, the area 
under the resonance is unchanged, which could led 
one to think that resonance “Doppler” broadening is 
not an import effect in a reactor application. This is 
true if the density of the resonance target nuclei is small 
(i.e., it is very dilute in the reactor), and thus its pres¬ 
ence does not change the energy dependence of the 
reactor’s neutron population. However, in most reactor 
designs, resonance absorbers are concentrated in local¬ 
ized reactor features (e.g., fuel elements, control rods) 
and there is significant self-shielding at the resonance 
peak. That is, neutrons with the “peak” energy will 
most likely be absorbed in the reactor “feature” 
irrespective of temperature-induced changes in the res¬ 
onance microscopic cross section. But the story can be 
different on the wings of a resonance where the cross 
section is much smaller, and, thus, so is the self¬ 
shielding. An increase in temperature of the “feature” 
can result in a net increase in neutron absorption, with 
no change at the peak energy, but with increases in the 
wings. This phenomena can aid in insuring a negative 
temperature coefficient for a fission reactor design 
('Temperature coefficients are collective reactor parame¬ 
ters that reflect how neutron balance is impacted by 
temperature change through feedback mechanisms, 
for example, Doppler broadening (or narrowing) of 
resonances, and moderator density changes. Reactor 
control will be discussed in Sect. Fission Reactor 
Performance.). A negative temperature coefficient is 
a crucial reactor design safety requirement. 

Dealing with temperature in producing resonance 
cross sections for design calculations is handled in 
a manner similar to the process for thermal energy 


range cross sections discussed above. The ENDF reso¬ 
nance cross-section models and appropriate material 
temperatures are input to NJOY, and the output are the 
broadened cross sections in the model format. How 
these cross sections are used in design calculations is 
addressed in the next section. 

As noted at the start of this section, gamma ray 
(photon) cross sections are important in fission reactor 
design as they are required for the full treatment of 
energy deposition throughout a reactor (in its fuel 
bearing core and supporting structures). In the order 
of importance with increasing gamma energy, the 
mechanisms of attenuation are the photo electric effect 
(y,e _ ), Compton scattering (y,y*), and pair produc¬ 
tion (y, e _ e + ). The photoelectric effect removes a 
y when its energy, hi), can eject an atomic electron. 
Its cross section is approximately proportional to 
Z 4 /(hu) 3 , and has discontinuities in energy as the ion¬ 
ization energy of various atomic electron shells are 
achieved. For higher y energies, Compton scattering 
interactions with atomic electrons can be treated as 
effectively free electron collisions. Conserving momen¬ 
tum and energy relativistically, one can derive expres¬ 
sions for energy loss and change in direction for 
initiating ys as a function of their incident energy. 
The magnitude of the cross section is proportional to 
Z. When y energies reach a threshold of 1.022 MeV 
(2 x m e c 2 ) and are in the field of a target nucleus, pair 
production of an electron and positron is possible. The 
magnitude of the pair production cross section is pro¬ 
portional to Z 2 . Of course, in tracking the y population 
in a reactor, one recognizes that annihilation of a pos¬ 
itron will produce two 0.51 MeV y’s. So pair produc¬ 
tion can be viewed as a form of inelastic scattering 
event. Cross sections for these three processes are tab¬ 
ulated in ENDF/B files, and they are described at a 
thorough but accessible level in the classic text by 
Robley D. Evans “The Atomic Nucleus.” 

An example plot of these cross sections for Th 232 is 
provided in Fig. 7. 

Finally, there are other gamma reactions which can 
take place in a reactor, for example (y,f) and (y,n) (the 
latter which we noted earlier as a source of delayed 
neutrons). However, these are threshold reactions for 
relatively high-energy gammas, and as shown in Table 3 
the total energy available from fission from fissile 
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Fission Reactor Physics. Figure 7 

Thorium cross section for the photo electric effect (y,e), for Compton scattering (y,y'), and pair production (y, e + e~) [32] 


isotopes for gammas is limited: <8 MeV for prompt 
y’s, and, <6.5 MeV for delayed y’s. Thus, these reac¬ 
tions are not important in determining the overall 
distribution of gammas in a reactor design. 

Neutron Distributions 

With the material provided to this point, the primary 
problem of reactor theory can be addressed: that of 
finding the neutron distribution in phase space 
(r,E,ft), of a reactor design, and subsequently the reac¬ 
tor’s power distribution, both throughout the reactor 
design’s lifetime. The first task is to derive the equation 
for the neutron density, N(r,E,ll), the neutron trans¬ 
port equation, and auxiliary equations for the atom 
densities, Pj(r,t), of depleting (initial inventory) iso¬ 
topes and important fission products (Like reactor 
physics, reactor theory is a traditional term in the 
nuclear engineering discipline. It refers to the study of 
the neutron transport equation and the means of its 
solution.). A simple approach to deriving the transport 
equation is to consider a balance relationship for 
N(r,E,ll,t) dEdQdt for a time invariant volume, V, in 
configuration space: 


dEdQdt J (9N(r, E,Q, t)/9t)d 3 r = 

V 

— loss from flow out of V 

+ # scattering into dE about E (44) 

and d Q about Q 

+ # produced by fission 

+ # produced by other sources 

For the “flow” term, one defines the neutron 
angular current J(r,E,il,t) = vN(r,E,lX,t), where 
|J(r,E,il,t)*n dsdEdQdt| is the no. of neutrons at r, 
with energies in dE about E, traveling in dQ about IX, 
which cross an area ds with a unit normal vector n in dt 
at t. And thus, net flow out of V, which has a non- 
reentrant surface S, is: 


dEdQdt 


• nds 


j J(r,E,n,t) 

r (45) 

dEdQdt / v • VN(r, E, Q, t)d 3 r, 


where Gauss’ Theorem is applied to transform the 
surface integral to a volume integral. 
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The “reactions” in V are simply: 
dEdQdt / vZ T (r, E, t)N(r, E, Q, t)d 3 r, 


(46) 


where Z T (r,E,t) is the total macroscopic cross section in 
V. All scattering cross sections included in Z T have been 
integrated over final energies and directions. 

Scattering into V is: 


oo 

dEdQdt J d 3 r J dE' j dQ' 


(47) 


v%(r, t, E' -► E,Q' -> Q)N(r, E',Q', t) 


where 2 S is the double differential macroscopic scatter¬ 
ing cross section in V (see the definition Eq. 65 for an 
example of a double differential microscopic scattering 
cross section). 

The direct fission source into V is: 


oo 

dEdQdt J d 3 rj dE' J dQ" ^ v pi (E' —> E) 


(48) 


2^(r,E',t)v , N(r,E',Q,t)/47t, 


where v p i (E r —> E) is the number of prompt neutrons 
emitted in dE about E by a fission of isotope i initiated 
by neutrons in dE' about E': the macroscopic fission 
cross section for isotope “i” is p i (r,t)a fi (E / ). 

The delayed neutron source into V is: 

dEdQdt ^Xdj(E)A f Cj(r, t)d 3 r, (49) 

j 71 V 


where X dj (E)/47i is the probability that the decay of 
delayed neutron precursor “j” will produce a neutron 
in dE about E and dQ about Q, and where Xj and Cj 
are, respectively, the decay constant (see the beginning 
of section “Heavy Elements”) and isotope density of 
precursor “j”. 

And, finally any source of neutrons in V not pro¬ 
duced by a neutron reaction is given by: 

dEdQdt^ d 3 rS(r, E, t)/47i, (50) 

V 

where S could characterize a source (e.g., Plutonium 
(238)-Beryllium(oc,n) or Califonium-252 (spontaneous 


fission)) included in a reactor design to aid in reactor 
start-up or S might account for decay processes yielding 
neutrons due to the presence of depleted fuel incorpo¬ 
rated from another design. 

Now, if the terms on the right side of the “balance 
relationship,” equation Eq. 44, are moved to the left 
side, and dEdQdt and the integral operation Jd 3 r is 

V 

factored out of all the terms, then as the right side 
is now zero and the small volume V is arbitrary, the 
collection of expressions under the integral must equal 
zero. The resulting equation is the Neutron Transport 
(or Boltzmann) Equation: 

2-N(r,E,Q,t) = —v» VN(r,E,Q,t) 

-vE r (r,E,t)N(r,E,Q,t) 

OO 

+ J dE' J d£l'v% 

o 

(r,t,E' -»E,Q' -► Q)N(r,E',Q',t) 

OO 

+ J dE 1 J dQV^Vpi(E'^E) 

o 1 

Z fi (r,E , ,t)N(r,E',Q',t)/47t 

+ ^-Xdj( E )AdjCj (r,t)/4jt 

j 

+ S(r,E,Q,t). 

(51) 

The conditions for solutions of this partial- 
differential-integral equation are the continuity 
condition: 

N(r + ocfi,E,Q,t) must be a continuous function of 
a for r + afl in the reactor, and the 

boundary condition: 

N(r s ,E,ft,t) = 0, for < 0, where n is an outward 
unit vector normal to a non-reentrant surface cho¬ 
sen to define the extent of the reactor. 

The auxiliary equations for number densities of 
delayed neutron precursors, Cj(r,t), and fission 
product poisons, depleting fissile isotopes, and burn¬ 
able poisons, Pi(r,t)s, are simply defined as movement 
of these isotopes in space can be ignored. Burnable 
poisons are elements with large neutron absorption 
cross sections (e.g., Boron, Hafnium, Cadmium, 
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Erbium, Gadolinium) that can be included in reactor 
designs to maintain neutron balance over design 
lifetime. 

• For delayed neutron precursors: 


equations may be needed to deal with transmuta¬ 
tion of one isotope of a burnable poison to another, 
which has a significant neutron capture cross sec¬ 
tion (for example Hafnium, which has four natu¬ 
rally occurring isotopes). 


J“Q(r, t) = —Ai Q(r, t) 


+ j dE' I do' 

0 j 

v'E fj (r,E',t)N(r,E , ,fi , t), 


(52) 


where ^ is the expected number of precursors of 
type i produced by fission of isotope j. 

• For fission product poisons (e.g., Xenon-135 and its 
precursors Telirium-135 and Iodine-135, and 
Samarium-149 and its precursors Neodimium-149 
and Promethium-149): 


d 

dt 


Pi( r > t) 


= — Ai/?i(r, t) + /l Hi pj(r,t) 


OO 

~ J dE 1 J dQVi(r,t)<7 c i 

0 

OO 

+ J dE' J dQ'^ 

o k 

f)v'N(r,£',Q',t), 

fk 



(53) 


where Yik(E) is the expected number of poison (or 
poison precursor) nuclei produced by fission of 
isotope k induced by neutrons of energy E. 

• For fissile fuel and burnable poison isotopes: 


^ Pi( r i t) = - J d£ ' J d£2'ft(r, t)ff a i(E')v' 

0 ' ' 
N(r, E',Q', t), 

where the subscript “a” as applied to a a x conven¬ 
tionally refers to capture plus fission for fissile iso¬ 
topes. For reactor designs containing fertile 
isotopes, equation Eq. 54 will have a source term 
reflecting the transmutation process leading to the 
fissile isotope £C i” (see Fig. 1). Additional “auxiliary” 


Solving these “auxiliary” equations, irrespective of 
their number, is not a calculational challenge, given 
one knows the neutron density, N(r.E,fl,t), as they are 
ordinary differential equations. Obviously, solving the 
neutron transport equation (Eq. 51) for N(r,E,Xl,t) is 
another matter. There are several features of fission 
reactors, however, that make this task more tractable. 
First, the density of neutrons needed to produce as 
much power as can be removed/transferred from 
various reactor types to do useful work is very small, 
~ 10 7 —10 9 #/cm 3 , where as the density of nuclei is many 
orders of magnitude larger 10 23 #/cm 3 . Therefore, 
the neutrons can be viewed as a very dilute gas in the 
matrix of a reactor’s nuclei, and thus neutron-neutron 
collisions can be ignored (they have not been 
accounted for in Eq. 51) (Another neutron behavior 
that can be ignored in formulating the transport 
equation is the finite lifetime of a free neutron. Its 
mean-life is ~11.5 min. But, as will be discussed in 
the next section, the lifetime of neutron in a reactor is 
measured in milliseconds.). Second, in the primary 
nuclear design calculations, where it is determined 
if a trial configuration, loading and geometry, of 
fuel, structure, moderator, coolant, control elements, 
and poisons, can sustain neutron balance through 
out the reactors lifetime objective, the time variable 
“t” in Eq. 51 can be treated in a much simplified 
manner. Given the initial conditions of a trial reactor 
configuration, it is a good assumption that the atom 
densities in the various macroscopic cross sections in 
Eq. 51 can be treated as nearly constant for a significant 
time interval, At > minutes. With this assumption, 
and no neutron-neutron collisions, Eq. 51 is linear in 
N(r,E,fl,t) during At, and solution methods for Eq. 51 
are greatly simplified. In addition, for the interval, At, 
Eq. 51 can be treated as a time-independent equation. 
For a primary nuclear design calculation, one ignores 
the source term S(r,E,t) (its importance to the 
start-up problem will be addressed in the next section) 
and Eq. 51 becomes a linear homogeneous eigenvalue 
problem: 
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vfi • V • N(r, E,£2) + v£ T (r, E)N(r, E,£2) 

OO 

-v J dE' J dfiVZ s (r,E'->E,n'^fi)N(r,E',n') 
0 

oo 

= l -jd E' J dQ'v'J^M^) 


Vpj(E' -4 E) + £x di (E)f u lN(r,E',£>')/47T 


(55) 


where 1/k is the eigenvalue. It has been customary to 
use the inverse of cc k” as the eigenvalue and to refer to 
k as the multiplication factor. Note that if Eq. 55 is 
integrated over the reactor volume and E and il, 
then k is equal to the ratio of neutron production to 
neutron loss, this is the origin of its designation as 
a “multiplication factor.” A further simplification in 
Eq. 55 arising from the assumption of “time indepen¬ 
dence” during At, is that delayed neutron production 
can just be added to prompt neutron production. As 
indicated, previously delayed neutrons, though less 
than a percent of total neutron yield in a fission, are 
critical to transient reactor behavior, and, therefore, 
control system design, to be covered in the next section. 

Before proceeding with the solution methods for 
Eq. 55, a description of how the primary nuclear design 
calculation proceeds is required for a basic understand¬ 
ing of reactor design, and to provide perspective on the 
utility of the various solution methods for the transport 
equation. Assuming the solution method chosen has 
yielded a ki and Ni (r,E,ft) for the initial time interval, 
Ati where i = 1, then one proceeds to check design 
requirements, and make needed modifications to the 
reactor’s initial trial configuration. In general, this is an 
iterative process including other disciplines (i.e., heat 
transfer and fluid flow, structural analysis). If k 1, 
inventories of fuel/poisons or the positioning of 
control elements will need to be altered to achieve 
neutron balance (Control elements (sometimes generi- 
cally called control rods) are structures incorporating 
highly neutron absorbing isotopes. They have a dual 
role in fission reactor design: (1) assuring criticality (a 
controlled steady state chain reaction) throughout 
a reactor’s design lifetime, i.e., compensating for poten¬ 
tial excess neutron production from the initial fissile 
loading which must be large enough to accommodate 


depletion; and (2) providing safe shut down (termina¬ 
tion of the chain reaction) of the reactor in case of an 
accident condition or during a planned interruption of 
normal operations (e.g., for plant maintenance or 
refueling). The same “structures” could accomplish 
both functions or there could be separate structures 
(sometimes referred to as “shim” and “shutdown” rods 
respectively).). The power distribution throughout the 
reactor must then be determined. As N(r,E,fl) is the 
solution to a homogeneous equation, its absolute mag¬ 
nitude is undetermined, but a normalization factor, p, 
can be established from the total thermal power rating, 
P(MWth), requirement of the design, that is from: 

dE / dJ2v^6jS j5 (r ) E)N(r ) E,Q) ) 

RVol. 0 ] 

(56) 

where Sj is the energy release per fission of isotope j (see 
Table 3). Now given pN(r,E,il), one can calculate the 
power distribution throughout the reactor 



oo 

P(r )=p /dE/ dQv^ 8j ^(r,E)N(r,E,Q), 


(57) 


and determine peak powers in fuel elements, and aver¬ 
age and peak heat fluxes into coolant channels. Thus, 
fuel element and heat removal system limits can be 
checked. If there are violations, the trial configuration 
must be altered and new results for k and N(r,E,li) 
found. From the power distribution and subsequent 
thermal analysis, one can also verify the temperatures 
that were assumed for the trial configuration , that is, the 
temperatures that were needed to define thermal scat¬ 
tering cross sections and Doppler broadened resonance 
cross sections. Finally, when all conditions are met for 
the first time interval, At i? where i = 1, the auxiliary 
equations (Eq. 53 and Eq. 54) can be solved to update 
inventories of fuel and poisons for the next time inter¬ 
val, At 2 . For a thermal reactor design, where the fission 
product poisons Samarium-149 and Xenon-135, are 
important, initial time intervals should be short 
(minutes) until equilibrium levels of these isotopes 
are achieved (^hours). Subsequent time intervals can 
be many hours. From this brief description of the 
primary nuclear design process , it should be clear that 
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having accurate and efficient solution methods for the 
time-independent neutron transport equation is key to 
achieving a successful design. 

There are two basic approaches to solving the 
time independent neutron transport equation, Eq. 55; 
the probabilistic-statistical Monte Carlo simulation 
method [14] where individual neutrons are traced 
through their life experience in a reactor, and analytic 
methods where the variables, r, E, and (sometimes) H, 
are made discrete, thus transforming Eq. 55 into 
a matrix equation. 

Monte Carlo simulation is in principle well suited 
for this application because the neutron density is low 
so the transport equation can be treated as linear. Thus, 
each “experiment,” that is, neutron history, is indepen¬ 
dent of all others. Initially, a neutron is started at a 
randomly selected reactor location and with a ran¬ 
domly selected direction, il. Its initial energy is 
selected by treating a typical prompt neutron energy 
spectrum as a probability distribution: 

Emax 

P(E)dE = v p (E)dE/ J dEv p (E), (58) 

E min 

which is then “sampled” with a random number 
between 0 and 1. In Monte Carlo computer programs 
[15, 16] a sequence of random numbers is generated 
with an algorithm. To provide an example of “sam¬ 
pling,” note that in this case, given a random number n, 
the “sampled” starting energy E is found by simply 
solving the transcendental equation: 

E 

dE / P(E / )forE. (59) 

Emin 

Now having a speed and direction, one can “sam¬ 
ple” (with a new random number) the probability that 
the neutron travels a distance x before having 
a collision, using equation Eq. 20. As this probability 
depends on the total macroscopic cross section, 
Z T (r,E), one must keep track of material boundaries. 
An initial sampling will be for the distance from the 
starting point of the neutron to the first material 
boundary it could cross. If the initial sampling results 
in the boundary being crossed, then there is a new Z T 
and a second sampling is performed to determine if 


another boundary is crossed. Eventually, either the 
neutron leaves the reactor or the location of the first 
collision is established. As the total cross section is the 
sum of capture, scattering, and possibly fission macro¬ 
scopic cross sections for the various isotopes present, 
one can treat the relative magnitudes of the compo¬ 
nents of the sum as a probability distribution, which 
when “sampled” leads to the next step in our neutrons 
history. If capture is the result, the history ends, just as 
it would end if the neutron leaked (escaped) from the 
reactor. Either capture or leakage is recorded as a “loss.” 
If fission is the result, the history also ends, but the 
number of neutrons produced (1, 2, or 3) in the fission 
of the “selected” isotope j, and the fission location are 
recorded. The number of fission neutrons is deter¬ 
mined by “sampling” the probability distribution for 
fission yield of isotope j. The mean of the distribution 
Vj (E) includes delayed neutrons. The number of neu¬ 
trons produced by a “history” is counted as “produc¬ 
tion.” If scattering is the result of the collision then the 
double differential macroscopic cross section is treated 
as probability distribution and “sampled” to provide 
a new energy and direction for the neutron. Then, the 
process of tracking to either escape or the next collision 
is repeated. Thus, the “history” proceeds until it ends as 
either “loss” or “production”, if “production” a starting 
point for a subsequent “history” and a location for 
energy deposition are also provided. 

There are various strategies for carrying out Monte 
Carlo calculations and the evaluation of the statistical 
nature of the results. A standard approach, in outline, is 
to run successive groups of “histories” (thousands). 
Discard the first few groups, which are used to establish 
a reasonable fission neutron spatial source distribution, 
and then find the mean and standard deviations of the 
desired calculational results from the subsequent 
groups. It is most economical to get good statistics for 
the eigenvalue k, the multiplication factor, which is just 
“production” over “loss,” quantities which are accumu¬ 
lated over the whole reactor. Energy deposition, that 
is, power distribution, results are much more costly. 
Hundreds of groups with millions of histories per 
group would be required to give good statistics (a five 
percent 95% confidence interval) for the number of 
fissions in small reactor volumes (e.g., a 1 cm length 
of a typical PWR fuel element which has a volume 
of 0.7 cm 3 , out of a reactor volume of 32.8 x 10 6 cm 3 
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(for a 3,400 MWth rating)). From this discussion, one 
can see why, as mentioned in section Cross Sections, 
Monte Carlo calculations of mock-up experiments are 
widely used in cross-section data set evaluations, where 
the results of interest are changes in k, the multiplica¬ 
tion factor. Even with the tremendous advances in 
computing capability which have been made to date, 
Monte Carlo simulation is not as yet the main line 
method for primary nuclear design calculations. But, 
as will be seen in the following description of analytic 
methods, it can greatly aid in improving the accuracy of 
the analytic methods. 

When the analytic approach of making the neutron 
density’s variables r, E and Q discrete is applied, so as to 
make the computational errors in solving the transport 
equation comparable to an exhaustive Monte Carlo 
simulation, the computer resource requirements 
will challenge today’s largest machines (peak speeds 
of ~2.3 x 10 15 flops (floating point operations per 
second) [17]). 

This can be demonstrated with the large PWR used 
in the Monte Carlo discussion: First , a spatial mesh of 
65.6 x 10 6 points would result, assuming quarter core 
radial symmetry, and from using a 1 x 1 x 2 cm 3 cell 
for averaging cross sections. (The mesh may need to be 
finer for highly absorbing features, e.g., fixed poisons, 
control elements, and can be courser in homogeneous 
regions.) Second , the energy variable can be treated 
with a multigroup approximation where the energy 
range, 0.0 —> 10 MeV is divided into intervals (groups), 
for example, 24 for thermal neutrons, 0.0 —> 0.625 eV, 
and 57 for the rest of the range, with most of these 
groups allotted to epithermal neutrons, E = 0.625 eV —> 
0.1 MeV where there is a concentration of explicit 
resonance cross sections (see Table 6). A weight func¬ 
tion f g (r,E), normalized for E to unity, is required for 
each interval, g. These can be calculated with infinite 
medium problems representing various portions of the 
reactor; homogeneous problems with no spatial vari¬ 
ables, or an infinite repeating array of cells (e.g., a fuel 
element and surrounding coolant) with radial spatial 
variables, but no axial, z, dependence. The power of the 
multigroup approximation is that it is insensitive to the 
choice of weight functions and thus simplifying 
assumptions can be made in selecting and solving the 
“infinite medium” problems to generate the f g s. 


Finally ; the direction variable, IX, can be dealt with 
through a discrete ordinate approximation [18]. The 
unit sphere is divided into segments, the surface areas 
of which sum to 471, and a direction vector XX n is 
assigned to each segment. The segment surface areas 
act as weight functions when the transport equation 
is integrated over a segment to yield an equation for the 
neutron density going in the direction XX n . There are 
various schemes for selecting ordinates and weights, 
the most widely known is the S n method. All methods, 
however, can produce ray effects if “n” is too small [19]. 
The channeling of neutrons into a few restricted direc¬ 
tions can produce anomalous results in reactor designs 
with localized neutron source regions, that is, where 
fissile and fertile fuel predominate in different regions 
(commonly referred to as seed-blanket designs). For 
a “highly accurate” treatment, one should let n = 16 in 
each octant, for a total of 128 ordinates. So for the 
“large PWR example,” the number of unknowns to be 
solved for in the discretized time-independent trans¬ 
port equation is 170 x 10 9 (=16.4 x 10 6 (spatial mesh 
points) x 81 (energy groups) x 128 (ordinates)). This 
is clearly a formidable calculational problem. If we view 
the analytic approach described here as transforming 
Eq. 55 into a matrix eigenvalue equation, then the 
simplest solution method of matrix inversion would 
involve matrices of a billion by a billion. Hence, an 
iterative method is required [20]. Much effort in reac¬ 
tor theory has been devoted to this problem, and to 
simplifying the analytic approach. Iterative methods 
for solving matrix equations are beyond the scope of 
this entry, but to understand how fission reactor design 
is actually carried out, a description of analytic 
approach simplifications is needed. 

The direction variable, IX, received the earliest atten¬ 
tion. Because the neutron population in a reactor can be 
viewed as a dilute gas, it was natural to assume that the 
variation of the neutron density in space could be approx¬ 
imated by V® D(r,E,t)VvN(r,E,t) (from Fick’s Law of 
diffusion). When the transport equation, Eq. 51, is 
integrated over IX and the first term on the right- 
hand side is replaced by the Fick’s Law expression, the 
result is the time-dependent neutron diffusion equa¬ 
tion. Equation 55 can be treated analogously to yield the 
time-independent neutron diffusion equation. In either 
case, the limitations of the diffusion approximation 
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only become apparent in trying to define the diffusion 
coefficient D(r,E,t) (see Henry, Nuclear-Reactor Anal¬ 
ysis). The most commonly used expression is 

D(r, E, t) = l/{3p t (r, E, t) - ^E s (r, E, t)]}, 

(60) 

where is the average of the average cosine of the 
scattering angle (in the laboratory coordinate system) 
of each isotope making up Z s (r,E,t). This definition 
(Eq. 60) arises from a low-order spherical harmonics 
expansion of N(r,E,lX,t) in the neutron transport equa¬ 
tion. Spherical harmonics are a complete orthonormal 
set of special functions on the unit sphere defined by Q 
(the unit vector which is at angle 0 from the z axis and 
projects in the x-y plane at angle cp from the x axis); 

Y™(Q) = H™P™(p)e im ^ for / = 0,1, 2, 3. 

m = — / < m < /, 

(61) 


where 
Hf = 


(2/+l)(l—m)! 
47i(/+m)!dm 

associated 


1/2 


, Vf(n) = sin m (n)-^V,(n) is 
the " associated Legendre Polynomial, 
P i(fi) = i(p 2 — 1)* is the Legendre Polynomial 
and p = cos0. The spherical harmonics are normalized 
by the relationship: 


2n 1 

J dip j dpYf{p,ip)Y^(p,ip) = 8 mm i6 ll i, (62) 

0 -1 

where Yf 1 is the complex conjugate of Yf 1 and the 
Kronecker deltas, 5, are 1 for m = m and / = V and 0 
otherwise. 

The “low-order” spherical harmonics expansion, 
which yields the diffusion approximation, is for l = 0 
and 1 (also referred to as the PI approximation). The 
four functions in the expansion are: 

Y° 0 = (1/4tt) 1/2 , Y- 1 = (3/47t) 1/2 sin 6e~ i(p , 

= (3/47r) 1//2 ^ and Yj = — (3/An) 1 ^ 2 sin 9e~ 1<p . 

(63) 

In a Cartesian coordinate system, the expansion 
coefficients (for simplicity of notation the time- 
independent case will be treated) are N(x,y,z,E), the 


neutron density and Nf 1 (x,y,z,E), N* (x,y,z,E) and 
N® (x,y,z,E), which when multiplied by v (the speed 
corresponding to E) are the neutron currents in the x, y, 
and z directions. Before applying the PI expansion 
to Eq. 55, one needs to note that the double differ¬ 
ential scattering cross section, Z S (E’ —► E,1T —> fl) 
in reactor applications (where neutron polarization 
and Bragg scattering can be ignored) depends on 
£T*£l. Then, given the addition theorem for spherical 
harmonics, 

pi(q'.q) = £ 

(64) 

Z s (r,E’ -> E, can be expanded in terms of 

associated Legendre polynomials. This is done in two 
steps: first the double differential scattering cross sec¬ 
tion is expressed as an expansion in ordinary Legendre 
polynomials (which are a complete orthogonal set of 
functions) 

2 S (r, E' -> E, Q' • fi) =£ s (r, E) V ^ 2l+1 ^ 

U 471 ( 65 ) 

F 1 (r,E , ^E)P 1 (Q'.Q), 

and then Eq. 64 is substituted for P^fT#!!). (It should 
be remembered that Z s is a macroscopic cross section, 
and a more complete notation would show a sum 
over contributions from each isotope present in d 3 r 
about r.) 

Now when the PI expansion for N(x,y,z,E,p,cp) is 
inserted in Eq. 55 and the resulting equation is, in turn, 
multiplied by the complex conjugate of each of the 
four spherical harmonics functions of the PI expan¬ 
sion (Eq. 63), and integrated over p ( — 1 —> 1) and 
cp (0 —» 27c), four equations result: The first is 

^-vN~ l (x,y,z, E) +2_vN|(x,y, z,E) 

+ ^ N J( x ,y, z ,E) = — v£ T (x, y, z, E)N(x, y, x, E) 

lOMev 

+ J dEV2 s (x,y,z,E')F 0 (E' -> E)N(x,y, z,E') 

0 

+ (the fission term in Eq.55 with N(r, E^Q') 
replaced with N(x, y, z, E')). 



( 66 ) 
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The remaining three equations are of the same 
form, one is provided here: 


1 d 

-v-N(x,y,x,E) + vS T (x,y,z, E)Nj(x,y,z, E) 

lOMev 

= J dE'v2 s (x,y,z,E')Fi 
0 

(E ; —» E)Nj(x,y, z, E') 


(67) 


With these four coupled partial-differential- 
integral equations, there is still considerable computa¬ 
tional complexity. To get to the standard neutron dif¬ 
fusion equation, one additional approximation is 
made: 


F^E' —> E) = <5(E' - E)/i(E / ), (68) 


where /i( E') is the average cosine of the scattering angle 
in the laboratory coordinate system, and energy loss 
(or gain) in the non-isotropic component of scattering 
is ignored. Substituting Eq. 68 in Eq. 67, the relation¬ 
ship between and N is a simple partial differential 
equation: 


N°(x,y,z,E) = - 


3 (S T (x, y, x, E) - /z(E)E s (x, y, z, E)) 


J“N(x,y,z,E), 


(69) 


containing the diffusion coefficient D(r,E) (see the 
definition Eq. 60). Given the equations of the same 
form as Eq. 69 for N} and Nj" 1 , and substituting all 
three into the left-hand side of Eq. 66. The left-hand 
side becomes the standard Fick’s Law expression: 

— V •D(r,E)VN(r,E) = .... (70) 

As noted above, this derivation of the diffusion 
approximation reveals its limitations. The ability of 
some combination of four low-order spherical har¬ 
monic functions, Eq. 63, to describe the true angular 
distribution of the neutron density throughout 
a reactor will be limited to regions where the distribu¬ 
tion is nearly isotropic, that is away from boundaries 
and highly absorbing features (control elements and 
fixed poisons). To address these limitations, special 
boundary conditions are used, and subsidiary 


calculations (to be discussed below) are made to pro¬ 
vide “fitted” cross sections for highly absorbing 
features. 

To make the diffusion approximation an efficient 
design tool, additional simplifications have been devel¬ 
oped. The differencing of the energy variable as 
described above for “multigroups” can be extended to 
a “few group” approximation. Again, weight functions 
are generated using accurate solutions to small region 
“cell” problems, which model repeating features of 
a reactor. But here the weight functions are applied 
over much larger energy ranges, three or four to cover 
the energy range of 0 to 10 MeV. Furthermore, one 
accepts the error associated with the weight functions 
not perfectly representing the spatial variation of 
neutron density energy dependence. 

The use of “cell problem” auxiliary calculations can 
be extended. As the core, the central fuel bearing 
region, of most reactor designs is made up of collec¬ 
tions of mostly fuel elements, and possibly some fixed 
poison and movable control elements, assembled into 
modules. One can perform highly accurate (e.g., Monte 
Carlo or multigroup, fine spatial mess, discrete ordi¬ 
nate) calculations for two-dimensional (radial) repeat¬ 
ing arrays representations of a core’s various modules. 
Then, for each module type, a series of corresponding 
few-group diffusion approximation calculations can be 
carried out, in which key few group cross sections 
are adjusted (“fitted”) to match reaction rates from 
the “highly accurate” reference calculation. One can 
the use these fitted cross sections in a full core three- 
dimensional few group diffusion calculation as part of 
the principle design process. 

Of course, with depletion, as inventories of fuel, 
fission products, and poisons are updated, fitting cal¬ 
culations will have to be repeated. This approach is 
particularly suited to the design of thermal reactors, 
that is, PWRs and BWRs, where the proper treatment 
of epi-thermal and thermal neutrons, including the 
effects of self-shielding by fuel and poison inventories, 
is crucial. 

In fast reactor design, where most fissions take place 
at energies above explicit resonances (liquid metals or 
gasses are coolants, and fissile and fertile densities and 
inventories are high (no effective moderators are pre¬ 
sent)) and the mean free path of fission neutrons is 
large ( 10 cm.), the treatment of fuel, structure and 
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coolant as an homogenized material, in formulating 
macroscopic cross sections, is a reasonable assumption. 
Treatment of the energy variable is also simpler as 
up-scattering is not important. As a result, multigroup, 
3D discrete ordinate calculations can be used in the 
principle design process for fast reactors. 

Finally, returning to thermal reactor design, there is 
another analytic calculational approach, which is in 
wide use, that should be mentioned. The general des¬ 
ignation is Nodal Methods. There are a number of 
variations under this title [21], but they all have as 
their starting point solving 2D module array problems. 
One needs to produce few group neutron distributions 
within each module. Each module (or depending on 
symmetry each half or quarter module) will be 
a “node.” Then coupling coefficients between nodes, 
both radially and axially, are generated. (It is predom¬ 
inately in defining coupling coefficients that the vari¬ 
ous “methods” differ.) The resulting nodal equations 
can be solved with modest computer resources, but, 
to obtain power distributions and to update invento¬ 
ries the full reactor neutron distribution must be 
constructed from the module solutions and the weights 
found for each node. Nodal Methods have been found 
to be particularly useful in applying the primary nuclear 
design process to evaluating refueling options for com¬ 
mercial (large scale) PWRs and BWRs, where partially 
depleted modules are relocated, “shuffled,” as new 
modules are added and fully depleted, “spent” modules 
removed during periodic refuelings. The computa¬ 
tional economy of nodal methods also allows them to 
be applied to fully time-dependent problems, particu¬ 
larly for accident analyses. The nature of these prob¬ 
lems will be discussed in the next section. 

Fission Reactor Performance 

In the previous section, the focus was the derivation 
of the neutron transport equation, and how it is 
solved in carrying out the primary nuclear design 
process. This quasi-static process involves a series of 
time-independent calculations of the neutron density, 
N(r,E,fl), and ultimately results in the configuration 
and inventories (loadings) that meet design require¬ 
ments for lifetime (total energy production), and nor¬ 
mal operation thermal performance (fuel element 
burn-up within limits and sufficient coolant flow 


provided by the design pumping power allocation). 
There are, however, additional design requirements 
that involve transients, that is, N(r,E,fl,t), which will 
be the subject here. 

The simplest approach to treating transient reactor 
behavior is through a “point” kinetics model. If one 
first multiplies the time-dependent transport equation, 
Eq. 51, by a weight function, W(r,E,H), and integrates 
over the reactor volume, energy (the full range, 0^10 
MeV), and direction (cos 0 from—1 to 1, cp from 0 to 
2 tt); and second, multiplies the time-dependent 
equation for each delayed neutron precursor, Eq. 52 
for Q(r,t), by W(r,E,H)X di (E) and performs the same 
integration over reactor volume, energy, and direction, 
the results are the point kinetics equations: 

^ - ^T(t) + f A,C,(t) + Q(t) (71) 

^^ = ^T(t)-A j C j (t) for j = 1,2,...I (72) 

where T(t) is an amplitude function; 

lOMev 

T(t) = J dV J dE J dQW(r,E,Q) 

reactor 0 

N(r, E, Q,t). 

In formulating the expressions for the kinetics 
parameters, p(t), (3(t), and A(t), it is convenient to 
factor the neutron density into a product of “shape” 
and amplitude functions. The shape function is; 

S(r, E, Q, t) = N(r, E, Q,t)/T(t). (74) 

Now the weight function is defined over the same 
domain (space, energy, and direction) as the neutron 
density, and thus from the definitions Eq. 73 and Eq. 74 
the normalization of S and W follows: 

lOMev 

J dV J dE J dOW(r,E,£l)S(r,E,Q,t) =1 for all t. 

reactor 0 

(75) 

In order for the point kinetics equations to provide 
accurate solutions for small changes in reactor config¬ 
uration, the weight function, W(r,E,12), is chosen to be 
the solution the adjoint equation corresponding to the 
time-independent transport equation, Eq. 55, for the 
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reactor of interest adjusted to be critical (i.e., the eigen¬ 
value k = 1). In the adjoint of Eq. 55, the variable pairs 
(E,I1) and (E\£l J ) are interchanged in the scattering 
and fission terms. The solution, N*(r,E,il), is referred 
to as the adjoint neutron density or the importance 
function. The latter name indicates the physical inter¬ 
pretation of N*. If the reactor described by Eq. 55 is at 
zero power (no neutrons) and a neutron is inserted at r 
with velocity v(E,H), the neutron level will increase to 
a steady-state value (remember the reactor is still crit¬ 
ical). This “level” per neutron added at (r,E,li) is N*(r, 
E,fl). How N*, acting as a weight function, improves 
the point kinetics equations will be discussed after p, (3, 
and A are defined and their physical interpretation 
given. To simplify notation, the scattering and fission 
integral operators in Eq. 51 are represented by A and G: 

(X) 

A= J dE 1 J dQVE s (r,E' —> E,Q' —> Q)», (76) 

0 

(X) 

G = y’dE , y’dQV^r i ( E')£fi(r,E>, (77) 

o 1 

where Vi(E’) is the total number of neutrons (prompt 
plus delayed) produced by fission of isotope i induced by 
a neutron with energy E’. In reactor kinetics, delayed 
neutron yields are expressed in terms of the ratio the 
number produced with a given half-life (i.e., a member 
of “delayed group,” j, as noted previously, see Table 4) by 
fission of isotope, i, to the total the total yield, Vf. These 
ratios are represented as /?j, where normally j = 1,2 ... 6. 

The parameter p(t) is the reactivity of a reactor and 
is a measure of how far from criticality (a steady-state 
chain reaction only from fission neutrons, no other 
neutron sources present) the reactor is at time t. This 
can be seen from the expression for p(t) that results 
from the derivation of Eq. 71 from the transport equa¬ 
tion (where the functional dependencies on r, E, H, 
and t are understood for Z T , W, and S): 

oo 

/ dV/dE/dQW[ — vQ*VS—vEjS 

reactor 0 

+ AS + ^X i (E)GS] 

p( t) =-4^-, 

/ dV /dE /dQW y X‘(E)GS 

reactor 0 i 

( 78 ) 


where X x (E) is the total fission spectrum for isotope i: 

X'(E) =Xj/E){l —/T} + y^X<jj(E)/j. (79) 

j=l 

Now note, if both the numerator and denominator 
of Eq. 78 are multiplied by the amplitude function T(t), 
and W is taken as 1, then the numerator is the 
total neutron production rate minus loss rate for the 
reactor. (When the first term in the bracket in the 
numerator is integrated, and Gauss’ theorem is applied, 
it yields the total neutron leakage rate from the reac¬ 
tor.) Similarly, the denominator is the total neutron 
production rate. Reactivity is a dimensionless parame¬ 
ter whether or not W is unity. If it is zero, the reactor 
is critical, if negative, subcritical and if positive, 
supercritical. 

Pj in Eq. 72 is the effective delayed neutron fraction 
for the jth precursor group, and P in Eq. 71 is the sum 
of the Pj’s: 


oo 

/ dV / dE /dQW y X i (E)i?jGS 

P; EE —-^---• (80) 

f dVfdEf dQWy X‘(E)GS 

reactor 0 i 

As with the expression for reactivity, multiplying 
the numerator and denominator of Eq. 80 by T(t) and 
letting W = 1, one sees that in this case Pj is the fraction 
of total fission neutrons produced in the reactor by 
precursor group j. 

The parameter A is the prompt neutron lifetime and 
is defined as: 


/ dV fdE f dQWS 

A =-^-. (81) 

f dV fdE f dQWj]X i (E)GS 

reactor 0 i 

Again multiplying the numerator and denominator 
by T(t) and taking W = 1, one sees that A equals 
the number of neutrons in the reactor divided by the 
rate of neutron production by fission. If the reactor is 
critical, Eq. 81 has the same form as the “fundamental 
equation of radioactive decay,” Eq. 11, and A can be 
thought of as the “mean lifetime” of a neutron born 
into the reactor. In the point kinetics equations, A is 
the mean prompt neutron lifetime , and the timing 
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of the appearance of delayed neutrons is treated 
explicitly through the behavior of their precursor, 
Cj(t) j = 1,2 .... I (usually = 6). 

The derivation of the point kinetics equations 
directly from the time-dependent neutron transport 
equation has been presented here to provide perspec¬ 
tive on approximations that are normally made to 
make the generation of the point kinetics parameters 
(p, (3, and A) practical. If Eq. 51 and its auxiliary 
equations could be readily solved for N(r,E,il,t), 
there would be no need for the point kinetics equa¬ 
tions. As it turns out, however, “practical” approxima¬ 
tions follow from the approaches described in the 
previous section for solving the time-independent 
transport equation. Henry in Nuclear-Reactor Analysis 
derives the point kinetics equations starting with the 
diffusion approximation (with energy a continuous 
variable). One could just as well start with a few 
group diffusion approximation which would provide 
a shape function (a vector) and from the adjoint of the 
few group diffusion equation, a weight function (also 
a vector). The advantage of using an adjoint weight 
function, irrespective of the approximation to the 
transport equation one starts from, is in calculating 
reactivity, p(t). In transients of interest in design, p(t) 
is the driver. It reflects changes in cross sections with 
time due to variations in temperature (coolant density, 
Doppler effects) and configuration (control rod 
motion). In whatever form Eq. 78 takes, given the 
neutron transport approximation used, if a changing 
cross section is represented as a starting value plus 
a time-varying small delta, 5s(r,E,t), and the shape 
function is represented as a time-independent function 
(e.g., from the initial neutron density of the reactor, 
the same problem that generates the adjoint) plus 
a time-dependent small delta, 8S(r,E,t), then the 
resulting calculation of p(t) will to first order in deltas 
only depend on 5x(t)‘s. Higher order terms can be 
ignored, and one does not need to calculate a time- 
dependent shape function. This is a classic perturba¬ 
tion problem. Henry (chapter 7) provides a detailed 
derivation. 

The time dependence of (3j(t) and A(t), Eq. 80 and 
Eq. 81, as used in the point kinetics equations can be 
ignored in most applications. Measured values of (3 
(and the from which it is summed) can be used 
directly (Table 4). If adjoint weighting is used, the (3’s 


will be a bit larger than the physical (3 £ s in a thermal 
reactor due to the increased “importance” of delayed 
neutrons with their lower initial energies (relative to 
prompt neutrons). Prompt neutron lifetimes primarily 
depend on the reactor type; for thermal reactors they 
are on the order of ~ 10 -3 s, and for fast reactors as 
short as 10 -7 s. They can be measured in zero power 
reactor mock-up experiments as ratios with (3 and p, or 
calculated directly from equation Eq. 81 with the 
approximations for W and S used to obtain p. 
A highly accurate calculation of A can be made with 
a Monte Carlo simulation where neutrons are intro¬ 
duced from the prompt neutron energy spectrum with 
an S(t = 0) spatial distribution. Each history would be 
timed and terminated with neutron absorption (cap¬ 
ture plus fission) or leakage. The “times” will yield the 
mean and standard deviation of the prompt neutron 
lifetime. 

One further note on reactivity, if a perturbation of 
a critical reactor configuration can be viewed as nearly 
instantaneous, that is, a step change, then a good esti¬ 
mate of reactivity addition or subtraction can be found 
by solving the eigenvalue (time independent) problem 
for the perturbed reactor: 

P = 1 - 1/k, (82) 

and if the initial reactor configuration is subcritical 
then the reactivity addition from a “step” perturbation 
can be found by performing two eigenvalue problems, 
the perturbed case as before, and one for the initial 
subcritical configuration (ignoring any nonfission 
source); 

P = 1/ko - 1/k, (83) 

where k 0 (<1) is the initial subcritical eigenvalue. As 
reactivity is a dimensionless ratio, it is often given as 
a percentage or in units of (3 (as defined by Eq. 80). In 
the latter case, the “units” are traditionally dollars and 
cents. If p equals (3, the reactivity addition is 1 dollar; 
if p equals 0.5(3 the reactivity addition is 50 cents. 
If a dollar of reactivity is added to a critical reactor, 
it is said to be prompt critical (critical on prompt 
neutrons alone) and delayed neutrons will not mitigate 
the resulting transient, a condition obviously to be 
avoided. 
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The motivation for the description of point kinetics 
provided here is best provided by Henry: 

► "Since mean free paths are fairly long and since the 
lifetimes of neutrons in a reactor are quite short, the 
effects of local perturbations on" N(r,E,fl,t) "will quickly 
spread throughout a reactor. The immediate conse¬ 
quences of perturbing a reactor locally (for example 
by changing a control rod slightly) is thus a 
readjustment of the shape of the" neutron density. In 
many cases this readjustment is slight and is com¬ 
pleted in a few milliseconds; after that the readjusted 
shape rises or falls as a whole depending on whether 
the initial perturbation increases or decreases k eff . For 
reactors in which transients proceed in this manner, 
merely being able to predict the change in level of the 
neutron density "is sufficient to permit a very accurate 
prediction of the consequences of perturbation." 

With today’s computer capabilities, solving point 
kinetics problems is not a great challenge, even with 
time-dependent reactivity reflecting feedback from 
power changes in the reactor. Henry and Duderstadt 
provide descriptions of applicable calculational 
methods (development of which inspired great inge¬ 
nuity in the past). In any case, to quote Henry again, 
point kinetics solutions provide “very accurate predic¬ 
tions of the consequences of (reactor) perturbations.” 
Thus, their utility in assuring that a reactor design 
satisfies fundamental transient requirements. Under 
normal operating conditions, a reactor must be inher¬ 
ently stable, that is, self-limiting. Reactivity must be 
reduced with increased temperature; that is, with 
reduced coolant density, increased mean thermal neu¬ 
tron energy, and Doppler broadening of resonance 
cross sections. These phenomena are dependent on 
reactor type. Clearly change in thermal neutron spectra 
is unimportant in a fast reactor. However, if coolant 
density decrease results in voiding, reactivity will dra¬ 
matically decrease in a thermal reactor, but in a liquid 
metal cooled fast reactor increased leakage must out¬ 
weigh a higher energy neutron spectrum (and an 
increased fission to capture ratio in fuel) to assure 
a negative reactivity effect. Also movement of control 
rods must reduce reactivity when reactor shutdown is 
desired. Some movable poisons, power shimming con¬ 
trol rods, could be included in a design to flatten (make 
more uniform) the power distribution throughout life. 


(Power flattening can reduce coolant pumping power 
requirements. As coolant flow must meet the heat 
removal needs of the hottest region of the reactor, 
minimizing excess flow to cooler regions increases 
overall power plant efficiency.) But, one must assure 
that the operating strategy for using such rods does not 
compromise the speed of reactor shutdown when it is 
required to deal with an accident condition. 

Point kinetics models can also aid in assessing 
Xenon override requirements for thermal reactors. 
Xe 135 with its extremely large thermal neutron capture 
cross section (a c = 2.7 x 10 6 b at E = 0.023 eV, for 
comparison a f U235 = 577 b at 0.023 eV) is the most 
important fission product poison. It is produced 
directly from fission and by decay of its precursor I 135 
(which is a direct fission product and has short-lived 
precursors, Sb 135 Te 135 -► I 135 ). Both Xe 135 and I 135 

have half-lives measured in hours (T 1/2 Xe = 9.14 h, 
T \/2 = 6.58 h). So, a point kinetics reactor model will 
show that when a reactor is started up, Xe 135 and I 135 
build up to equilibrium levels in about 30 h. Their 
levels, inventories, will depend on power level, that is, 
neutron density, and the reactor design must have 
enough excess reactivity (e.g., control rods that can be 
withdrawn, or for a PWR, a soluble poison in the 
coolant whose concentration can be reduced) to “over¬ 
ride” the negative reactivity perturbation of neutron 
capture by Xe 135 (iodine neutron capture can be 
ignored). In addition, when a thermal reactor is shut 
down, Xe 135 builds up as loss by neutron absorption 
stops and decay of I 135 continues. The Xe 135 inventory 
peaks in about 10 h to approximately three times its 
equilibrium level and subsequently decays. It is back to 
its equilibrium level in ~ 40 h (see Fig. 8). 

Clearly additional “excess reactivity” is required to 
deal with a post-shutdown Xenon transient. Eventually, 
not being able to provide (and control) this excess 
reactivity can limit the useful lifetime of a thermal 
reactor design. As a historical aside, it was John 
Wheeler [3] who recognized the role that Xe 135 and 
I 135 could play in the operation of a thermal reactor. 
He explained the initial difficulties in operating the 
first plutonium production reactor at the Hanford 
Washington site. 

While point kinetics can deal with most reactor 
design transient requirements, there are instances 
where significant spatial effects must be accounted 
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Fission Reactor Physics. Figure 8 

Schematic of I 135 and Xe 135 inventories following an initial 
reactor start-up and subsequent shutdown after 
equilibrium levels have been reached (i.e., at t 0 ). 
(Duderstadt) 

for. In the normal operation of a large thermal reactor, 
there is the potential for spatial oscillations of Xenon 
concentrations, and therefore neutron density and 
power. These oscillations may not affect criticality and 
might only be observed by the reactor instrumentation 
system’s ability to measure power distributions in core. 
The period of these oscillations would be many hours 
and thus limits on coolant channel performance and 
fuel temperature could be compromised for extended 
time intervals. Space-time calculations of neutron den¬ 
sity have to be carried out to determine if a particular 
design has a propensity for these oscillations. If the 
design is not inherently stable, its instrumentation 
must provide detection and an operating strategy 
must be devised to suppress any oscillation initiation. 
With modern computing capability, few group 
diffusion approximations to the time-dependent 
transport equation, Eq. 51 (with explicit spatial mesh 
or nodal methods), can deal with Xenon oscillation 
evaluations [22]. 

There are certain reactor plant accident scenarios, 
which also require space-time calculations. To deal 


with these, nodal methods, as described in section 
“Neutron Distributions”, have been incorporated in 
safety analysis programs (see for example [23]), which 
model a full power plant; the reactor, its neutronics, 
fluid mechanics and structural integrity; and the bal¬ 
ance of the plant, instrumentation, coolant/working 
fluid to power conversion, and safety systems (contain¬ 
ment, emergency coolant injection and power supply). 
There are two classes of accidents where space-time 
effects in the reactor are important. First is “rod ejec¬ 
tion,” where a single control rod or group of control 
rods is rapidly withdrawn with a large reactivity addi¬ 
tion and neutron distribution change, in no way a small 
perturbation in the point kinetics sense. Second is 
a “cold water accident” applicable to a PWR. In this 
case, in a plant with multiple piping “loops” carrying 
coolant to the reactor, if one of the loops is not func¬ 
tioning so that water in the loop has cooled below the 
normal inlet temperature of the reactor, and the loop is 
reactivated (its pump turned on and isolation valves 
(if any) opened), while the reactor is critical, there can 
be large asymmetric reactivity insertion, again, this is 
not a small perturbation problem. 

Finally, there is an additional aspect of reactor neu¬ 
tron time variation, which has interesting Physics. The 
transport equation, derived in section “Neutron Dis¬ 
tributions”, is for the mean neutron density, N(r,E,H,t), 
but clearly as neutron interactions and production, and 
fission product decay are inherently probabilistic, there 
are fluctuations in the neutron population in a reactor 
(and as there are in delayed neutron precursor 
populations). These fluctuations were recognized 
early on in the Manhattan Project and are also referred 
to as neutron noise [24]. There are several approaches 
to modeling of the phenomena. The most fundamental 
is based on the derivation of the neutron transport 
equation from the quantum Eiouville equation 
(Osborn and Yip [25]). This derivation is extended to 
produce an equation for the neutron doublet density, 
N NN (r,E,H,r’,E’,fl’,t), the expected number of neutrons 
in d 3 r about r, with energies in dE about E, going in the 
solid angle dQ about times the expected number in 
d 3 r about r’, and so on for E’ and where as in the 
transport equation neutron-neutron collisions are 
ignored. Additional equations for neutron-precursor 
and precursor-precursor doublet densities are pro¬ 
duced to complete the set of equations needed to 
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solve for N nn . One also needs equations for the “mean” 
(in Osborns nomenclature, singlet) neutron and pre¬ 
cursor densities, which were derived in section “Neu¬ 
tron Distributions” (Eq. 51 and Eq. 52). Being able to 
solve for N nn is, however, not sufficient to predict the 
results of neutron “noise” experiments in a reactor. 
Equations for doublet and singlet densities for events 
(D) in detectors (e.g., pulse or continuous currents in 
an ionization chamber) are also needed. With N DD and 
N d one can predict variance to mean ratios of counts or 
the power spectral density of the current in a single 
detector, and the cross power spectral density of cur¬ 
rents in two detectors [26]. These experiments are 
usually performed in a reactor in a steady-state condi¬ 
tion (in the mean of course) at zero power, critical or 
slightly subcritical, to obtain estimates of point kinetics 
parameters. These experiments have the advantage of 
verifying expected kinetic performance without put¬ 
ting the reactor into a transient. They are performed 
at low neutron levels because as power is increased 
fluctuations become negligible relative to the mean. 
Detector noise measurements (psd and cpsd) are some¬ 
times made at power in operating reactors to monitor 
for unplanned mechanical motion, or loose parts. 
Modeling for these measurements is deterministic. 

Fluctuations in neutron populations must be con¬ 
sidered in developing initial start-up procedures for 
a newly constructed reactor (In a reactor design which 
incorporates fuel (including fission products and trans- 
uranics) from a previously operated reactor, natural 
source levels will most likely be high enough to allow 
“fluctuations” to be ignored.). The mean neutron den¬ 
sity before start-up depends on an external source of 
neutrons, S(r,E) (not from neutron reactions, Eq. 50) 
which, as part of the design, could be adjacent or 
internal to the reactor. At start-up, the reactor is sub- 
critical (k < 1) and the mean neutron population, in 
a point reactor sense, is N = AS(k/(l-k)). In outline, 
the steps to bring the reactor critical are to pull control 
rods up (down in most BWR designs) from their fully 
inserted position so as to insert some precalculated 
amount of reactivity and then wait for a new steady 
neutron level to be achieved. The subcritical neutron 
level is monitored by the reactor’s source range detec¬ 
tors. A power reactor is instrumented with detectors for 
the full range of expected neutron levels. These pull- 
and-wait steps are repeated until the reactor is slightly 


super critical, and thus the neutron level is observed to 
be on a continuously increasing, but easily controlled, 
trajectory. A pull-and-wait procedure needs to account 
for neutron level fluctuations because if the observed 
level, due to a minimizing fluctuation, is below the 
expected (mean) level when the reactor is actually 
close to its critical configuration, then the next “pull” 
might produce an unacceptable rapidly increasing tra¬ 
jectory [27]. The simplest way to avert a problem with 
a pull-and-wait procedure is to assure that the sources 
provided in the design (In a reactor design which 
incorporates fuel (including fission products and trans- 
uranics) from a previously operated reactor, natural 
source levels will most likely be high enough to allow 
“fluctuations” to be ignored.) are strong enough to 
render subcritical neutron fluctuations negligible. Of 
course, one has to understand neutron level fluctua¬ 
tions to make this assessment [28]. 

Future Directions 

Fission reactor development, since its inception, has 
progressed with advances in computing capability. 
Early on, analog computers were used for transient 
analyses, but digital computers have been the primary 
tool. Design codes have been adapted to advancing 
digital technology; scalar processing, then vector 
processing and now massively parallel processing. 
This is an active field today and should continue to be 
so, particularly as the drivers for improve computer 
technology are universal. In section “Neutron Distri¬ 
butions” the two approaches to solving the neutron 
transport equation, Monte Carlo simulation and ana¬ 
lytic methods (differencing variables and solving the 
resulting matrix equations) were described. Parallel 
computing would appear to be well suited to Monte 
Carlo as independent histories can be run on the var¬ 
ious (1,000’s of) processors simultaneously. There is, of 
course, the need to provide the reactor configuration 
(geometry, nuclide inventories, and cross sections) and 
the Monte Carlo code itself to each processor that runs 
a history. This challenge is being accepted with consid¬ 
erable success as exemplified by the accomplishments 
of the Los Alamos National Laboratory group working 
on the MCNP code [15] and the joint effort at the 
Knolls and Bettis Atomic Power Laboratories on the 
MC21 code [29]. Advocates of the analytic approach 
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have, however, not accepted the ultimate triumph of 
Monte Carlo. This is clear in the work of a group at 
the Argonne National Laboratory, which has modestly 
named their multigroup, discrete ordinate code UNIC, 
for Ultimate Neutronic Investigation Code [30]. They 
are demonstrating impressive results for fast reactor 
designs. Competition in supercomputer development 
and in attendant codes for nuclear reactor design bodes 
well for better products in the future. 

Physicists in their efforts to understand the atomic 
nucleus have made myriad measurements and only 
partially by design these have included the neutron 
and gamma cross sections, and fission product yields 
(and their decay mechanisms) needed for the develop¬ 
ment of fission reactors. Today, work on this reactor- 
related data is focused on establishing well-founded 
uncertainty measures. The Cross Section Evaluation 
Working Group of the National Nuclear Data Center 
refers to this effort as covariance evaluation [31]. This 
is particularly appropriate, for as discussed above, one 
expects calculational methods to improve with com¬ 
puter power and code development. Thus, in assigning 
error bounds in design, the uncertainty in basic data 
will become more important relative to the contribu¬ 
tion of calculational error (e.g., Monte Carlo statistics or 
differencing and convergence error in analytic methods). 
More well-founded and hopefully smaller design error 
bounds can obviously be taken advantage of in future 
reactor development. Improving error bounds is also 
consistent with the approach to overall power plant 
safety analysis being fostered by many of the world’s 
nuclear regulatory agencies. They favor best estimate 
analyses plus the assignment of rigorously defined uncer¬ 
tainty factors for various classes of accident conditions. 
Reactor design error is only a contributor to a safety 
analysis “uncertainty factor,” but for power plant tech¬ 
nology to advance, “reactor design” must do its part. 

Much of the research for a next generation of fission 
reactor power plants is focused on higher operating 
temperatures. Today’s thermal reactor plants, PWRs, 
BWRs, and CANDU (heavy water moderated) have 
outlet reactor coolant temperatures, T H > of ~600°F, 
and thus thermal (Rankine cycle) efficiencies in the 
low 30%. Raising T H would increase cycle efficiency 
and lower fuel cost, and provide high-temperature 
process heat (possibly for a catalytic hydrogen produc¬ 
tion process). Higher operating temperatures in a water 


(or heavy water) environment present core and struc¬ 
tural materials challenges. There likely will be a need for 
additional resonance cross-section data for some addi¬ 
tives (e.g., Manganese and Chrome) to new high- 
temperature materials. Fast reactors (liquid metal or 
gas cooled) operate at much higher temperatures than 
thermal reactors but also require much higher fissile 
inventories to attain criticality. Experience with design 
and operation of these reactors is limited (especially 
for gas cooled reactors) compared to thermal reactors. 
Their future development with emphasis on the burn¬ 
ing of unwanted transuranics as well their traditional 
mission of efficient conversion of fertile isotopes (U 238 
and Th 232 ) to fissile isotopes (Pu 239 , Pu 241 and U 233 ) 
will stimulate some cross-section work. But, both ther¬ 
mal and fast reactor development will most likely ben¬ 
efit more from advances in branches of physics other 
than Nuclear, particularly Condensed Matter and Fluid 
Mechanics. The need for nuclear power, both economic 
and environmental (if they can be separated?), will 
drive fission reactor development. While Uranium 
and Thorium are abundant in the earth’s crust, power 
demand will push reactor development to most effi¬ 
ciently exploit the highest grade ores, which will likely 
lead to a mix of fast and thermal reactors. In any case, 
physics as well as engineering will play key roles in this 
development. 
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Glossary 

Cementitious Said of material that has self-cementing 
properties in the presence of water alone. 
Cenospheres Known as “microspheres” that are small, 
lightweight, inert, hollow spheres filled with air 
and/or gases, comprised mainly of silica and alu¬ 
mina, produced from coal combustion process like 
fly ash, and used widely mainly as high-quality, 
lightweight fillers. 

Coal A sedimentary rock of organic origin, consisting 
mainly of carbonized plant remains and containing 
some other elements, such as sulfur, hydrogen, and 
nitrogen, and mineral impurities. 

Coal combustion by-products (CCPs) Residues gen¬ 
erated from the combustion of pulverized coal, 
which include fly ash, bottom ash, boiler slag, and 
flue gas desulfurization (FGD) materials. 

Coal mineral matter Material in coal derived from 
minerals present in the original plant materials 
that formed the coal, or from external sources 
such as sediments deposited with the plant remains 
or precipitates from interstitial water. This mineral 
matter converts to ash during coal combustion. 
Concrete A construction material produced from 
a mixture consisting primarily of aggregates (sand 
and crushed stone or gravel), Portland cement, and 
water. Certain coal ashes can be used as a replacement 
for a portion of the Portland cement. 

Pozzolan Siliceous or aluminosilicate materials in fine 
powdery form that, in the presence of water, will 
chemically react with the calcium hydroxide 
released by hydration of Portland cement to form 
compounds possessing cementitious properties. 
Supplementary cementing material A material that, 
when used in conjunction with Portland cement, 
contributes to the properties of hardened cement 
through its hydraulic or pozzolanic activity, or both. 

Definition of the Subject 

Fly ash consists of the inorganic matter within coal that 
has been melted at high temperature during coal com¬ 
bustion, solidified while suspended in the flue gases, 
and collected by electrostatic precipitators or other 
devices before the flue gases are released into the atmo¬ 
sphere. Fly ash is the lighter, less dense part of coal 
ash, consisting of small spherical particles having 


pozzolanic or some cementitious properties. It is dis¬ 
tinguished from the heavier, denser part of coal ash, the 
bottom ash, which is collected at the bottom of the 
boilers and consists of coarse nonspherical particles 
that lack pozzoanic/cementitious properties. Fly ash, 
also known as pulverized fuel ash (PFA) because of its 
fine powdery form and pozzolanic and cementitious 
nature, has many established applications in the pro¬ 
duction of concrete-related construction and pavement 
materials. 

Introduction 

The pulverized coal firing system for electricity gener¬ 
ation was developed in early twentieth century in the 
USA, and since then, the system has been adopted in 
more than 50 countries. As pulverized-coal-fired gen¬ 
erating stations were constructed worldwide, increas¬ 
ing quantities of coal combustion by-products became 
available. Fly ash is the most abundant coal combustion 
by-product and the amount generated has increased 
dramatically during the past few decades. It has become 
apparent that simply discarding such large quantities of 
this material would be environmentally unacceptable 
and methods for its reuse have been developed for 
years, especially in Europe, the USA, India, and 
China. Nevertheless, there is a great variation in usage 
among countries, from total reuse to almost complete 
disposal. Efficient, environmentally sound, and cost- 
effective handling of fly ash is a continuing challenge 
for major fly ash producers, marketers, and regulators 
worldwide. This entry provides useful resources and 
a description of the generation, specifications, manage¬ 
ment, and utilization of fly ash produced by pulver¬ 
ized-coal-fired power plants. 

Fly Ash Production and Rate of Utilization 

A schematic flow diagram of fly ash production in 
a typical pulverized-coal-fired power plant equipped 
with an electrostatic precipitator for fly ash collection is 
shown in Fig. 1. 

In a coal-fired power plant, pulverized coal is 
burned in a boiler under high temperature and the 
heat generated converts water to steam for electricity 
generation. The noncombustible mineral matter in 
coal (typically 10-25 wt% in US coals and 30-50 wt% 
in the coals of India) is converted to coal ash. Fly ash, 
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Fly Ash. Figure 1 

Fly ash generation in a pulverized-coal-fired power plant equipped with a dry-bottom boiler, an electrostatic precipitator, 
and a flue gas desulfurization (FGD) device 


the lighter part of coal ash, is suspended in the flue gas 
and is collected before release of flue gas into atmo¬ 
sphere. Fly ash collection systems include electrostatic 
precipitators, baghouses, fabric filters, or mechanical 
devices such as cyclones. The electrostatic precipitator 
is the most common and efficient of these; it can 
remove about 99.9% of the fly ash in the flue gas. 
Using a dry-bottom pulverized-coal-fired boiler, fly 
ash accounts for 75-85% of the total coal ash produced 
[1] and the remainder is the heavier, denser part, the 
bottom ash, which is collected at the bottom of the 
boilers (Fig. 1). The solid waste materials derived 
from coal combustion are known as “combustion 
residues” or “coal combustion by-products (CCPs).” 
If a pulverized-coal-fired power plant is equipped 
with a flue gas desulfurization (FGD) system for burn¬ 
ing high-sulfur coal, another type of CCP, known 
as FGD residues, is also produced. The amount of 
all CCPs being generated has increased dramatically 
around the world during the past few decades as 
the number of coal-fired power plants increased 
and as pollution regulations forced plants to reduce 


their emissions. However, fly ash is the most abundant 
among these CCPs. 

Organizations have been created in many countries, 
usually with the support, or collaboration of their gov¬ 
ernments, for the purpose of minimizing landfill and 
maximizing the utilization of CCPs. They provide 
information and assistance to those who are interested 
in developing or using coal combustion by-products in 
value-added applications. These organizations also col¬ 
lect statistical data, and analyze trends in the produc¬ 
tion and utilization of CCPs and their disposal 
practices. These organizations include, but are not lim¬ 
ited to, the American Coal Ash Association (ACAA), 
the European Coal Combustion Products Association 
(ECOBA), the Ash Development Association of 
Australia (ADAA), the United Kingdom Quality Ash 
Association (UKQAA), and the Fly Ash Utilization 
Program (FAUP) in India. For a more comprehensive 
list see “Useful Resources” at the end of this entry. 

ACAA compiled data on production and usage of 
CCPs in the USA as early as 1966. They indicate that the 
fly ash production rate in the USA increased from 
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about 59.4 million tons in 1996 to about 68.1 million 
tons in 2001 and to 72.4 million tons in 2008. ACAA’s 
figures use short ton (1 short ton=2,000 lb = 0.9072 
metric tons). The amount of fly ash recycled into com¬ 
mercial products also increased from about 22% in 
1996, to about 32% in 2001, to about 42% in 2008. 
More than half of the remaining fly ash produced in 
2008 was still deposited in landfills or storage ponds [2] 
at a cost to the utility companies and thus to the 
consumers. 

Typical coals in India have ash contents ranging 
from 30 to 50 wt% [3]. During 1999-2000, the utility 
industry in India produced about 70 million tons of 
coal ash per year, but only about 10% of the ash was 
used in ash dike construction and land filling, about 
3% was used in other construction products, and the 
remainder was disposed of in landfill [4] . In other more 
developed countries, 80% or more of the fly ash gen¬ 
erated was used in manufacturing concrete, bricks, and 
cellular concrete blocks, in soil-stabilization treatment 
and land fill applications, in agriculture, in recovery of 
metals and cenospheres, and in dam construction. 

Realizing coal ash posed a significant environmen¬ 
tal problem, in 1999, the government of India set up the 
“Flyash Mission” in New Delhi under its Department 
of Science 8c Technology with a goal of coordinating 
ash utilization efforts and aiming at 100% fly ash uti¬ 
lization [3] . A law was approved by Indian government 
in 1999 [5] requiring every construction agency engaged 
in the construction of buildings within a radius of 50 km 
(amended to 100 km in 2003 [6] ) from coal-fired power 
plants to use only fly ash-containing construction 
products, and those manufacturing fly ash construc¬ 
tion products should contain at least 25 wt% fly ash. 
During 2004-2005, when the utility industry in India 
generated about 112 million tons of fly ash per year, the 
FAUP (Fly Ash Utilization Program) in India has 
succeeded in increasing fly ash utilization from 1 mil¬ 
lion tons per year in 1994 to about 42 million tons per 
year in 2005 [7]. However, the remaining 70 million 
tons of fly ash still ended up being deposited in ash 
ponds or lagoons that covered more than 26,000 ac of 
arable land [8]. 

In China, there are incentive measures to encourage 
utilization of fly ash, especially in the concrete and cement 
industry. For example, the company that produces 
cement products with more than 30% of fly ash will 


receive tax deduction. Some areas of China such as Bei¬ 
jing, Shanghai, and Hubei, recycle 100% of their fly ash. 
The national figures from China Energy Development 
report indicated that total fly ash production in China 
increased from 148 million tons in 2000, to 229 million 
tons in 2004, and is predicted to reach 378 million tons in 
2010. The fly ash usage rate also increased from 58% in 
2000 to 68% in 2004. Similar to other major fly ash- 
producing countries, the remaining huge amount ended 
up in ash ponds [9] . 

Many individual scientific papers have described 
the production and usage rates of fly ash from around 
the world. Dhadse and coworkers published a review 
paper on fly ash characterization, utilization, and gov¬ 
ernment initiatives in India [10]. In their paper, the fly 
ash production and usage rates around 2005 in various 
countries were compared (Table 1), with data from 
TIFAC (Technology Information, Forecasting and 
Assessment Council) in India http://www.tifac.org.in. 
As indicated in Table 1, the US figures differ from those 
reported by the ACAA, and the figures for China differ 
widely from those in the China Energy Development 
report described earlier in this section [9]. The ACAA 
report indicates that in 2005, the USA produced about 
71.1 million (short) tons of fly ash and only about 41% 
of it was used. Nevertheless, Table 1 provides 


Fly Ash. Table 1 Fly ash generation and utilization in 
different countries (printed with permission of the Journal 
of Scientific & Industrial Research [10]) 


Country 

Annual ash production, 
MT 

Ash utilization, 

% 

India 

112 

38 

China 

100 

45 

USA 

75 

65 

Germany 

40 

85 

UK 

15 

50 

Australia 

10 

85 

Canada 

6 

75 

France 

3 

85 

Denmark 

2 

100 

Italy 

2 

100 

Netherlands 

2 

100 
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a reasonably accurate worldwide comparison in fly ash 
production and usage. These data indicate that fly ash 
production in different countries varied from 2 to 
112 million tons with the major producers being 
India, China, and the USA. The fly ash utilization rate 
varied widely from 38% to 100%, with the total fly ash 
being recycled in the countries of Denmark, Italy, and 
the Netherlands. 

Worldwide production and consumption of coal 
for energy has increased dramatically, especially in 
recent years, in countries such as India and China. 
This is because the total world reserves for coal (250- 
1,500 years at current consumption rates) are the most 
abundant, cheap, and easily produced energy resource, 
compared to the reserves for natural gas (70-120 years) 
and crude oil (40-60 years) [11]. The cost of using coal 
for electricity generation may become more expensive 
in the future when regulations for green house gas 
emissions are implemented. Currently, coal provides 
more than half of the US electricity supply and will 
continue to be the fuel of choice for worldwide power 
generation for a couple decades to come [11]. There¬ 
fore, managing coal fly ash in an efficient, environmen¬ 
tally sound, and cost-effective manner will remain 
a continuing worldwide challenge. 

Fly Ash Management 

Figure 1 shows fly ash produced in a typical pulverized- 
coal-fired utility boiler and collected by an electrostatic 
precipitator. There are two commonly used methods 
for removing the fine powdery fly ash from the fly ash 
hopper of the precipitator - a wet method or a dry 
method [12-15]. 

In the wet method, water is used to flush the fly ash 
out of the hoppers and the ash slurry is pumped 
through pipes to large, man-made settling ponds in 
the vicinity of the power plant. This method was widely 
used because it was the cheapest method for removing 
and storing fly ash. Utility companies that use an ash 
pond for fly ash disposal have three basic options 
available when the ponds are full: (1) constructing 
another ash pond nearby; (2) excavating the fly ash 
from the pond and transporting it to a land fill; and 
(3) raising the edge of the existing pond by 
constructing a dike to allow disposal of more fly ash 
in the pond. 


In the dry method, vacuum suction or high- 
pressure compressed air is used to fluidize and trans¬ 
port the dry fly ash to storage silos, from which it can be 
either reused or disposed of in a dry or wet form. Most 
silos are equipped with fluidizing capability for easy 
discharge of the fly ash through their bottom outlets. 
Fly ash in a dry form can be directly discharged into 
enclosed containers, tanks, or wagons [13]. Typically, 
a little water is added to the dry fly ash to allow 
stockpiling or transporting in a conditioned form 
(about 15 to 30% moisture) in trucks or rail cars. The 
main advantage to conditioning the fly ash is to control 
the dust that maybe released during transporting of the 
material. 

In areas where there is an established fly ash market, 
coal-fired power plants typically would install a dry fly 
ash handling system to allow beneficial reuse of the fly 
ash and avoid disposal in impoundments. In areas, 
with limited or no markets for fly ash, the expense of 
installing and operating dry collection systems has been 
difficult to justify if the space needed for ponding or 
wet stacking is available near the power plant. 

Finding a beneficial application is the preferred 
method for managing fly ash because it provides 
a way to convert fly ash from a liability to a value- 
added raw material. Also, the final products containing 
fly ash, in general, are more durable and of better 
quality than their conventional counterparts. Environ¬ 
mentally, use of the fly ash can reduce land disposal 
requirements and reduce the use of other natural 
resources, such as the lime or clay that the ash replaces. 
Economically, recycling can reduce the utility’s costs 
associated with fly ash disposal and may increase rev¬ 
enues for both fly ash producers and users by using fly 
ash to replace higher cost conventional raw materials. 
The performance benefits may vary depending on the 
properties of the fly ash and the type of application. For 
example, the largest single use of fly ash worldwide is in 
concrete/cement products (see discussion in section 
“Applications”), and fly ash-based concrete products 
or pavements have been found to have better engineer¬ 
ing properties and longer service life than conventional 
concrete products. 

Although power companies produce ash at their 
coal-fired power plants, most power companies do 
not handle the disposing or marketing of the ash they 
produce [16]. Commercial ash marketing companies 
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operating in many countries have been playing increas¬ 
ingly important roles in fly ash management. These 
companies are able to maintain a product with consis¬ 
tent chemical and physical characteristics, a factor that 
is extremely important, especially for concrete/cement 
product applications. Some companies have facilities 
for research and development, and facilities for fly ash 
characterization and processing. They can blend differ¬ 
ent fly ashes, or remove unburned carbon from a high- 
carbon fly ash to meet concrete specifications. Some 
companies provide coal-fired power plants with com¬ 
plete on-site ash handling and management, including 
the installation of dry fly ash collection systems and the 
environmental and engineering services. Power plants 
that use ash marketers to sell their fly ash may employ 
an ash marketing specialist whose responsibility is to 
monitor ash generation and quality and interface with 
ash marketing companies or brokers who are under 
contract to the utility company. 

There are about 40-50 commercial ash marketing 
companies operating throughout the USA, in all states 
with an exception of Hawaii [ 16, 17] . Some commercial 
ash marketing companies are listed in the “Useful 
Resources” at the end of this entry. 

Fly Ash Properties and Classification 

Figure 2 shows photos of small quantities of a bitumi¬ 
nous coal fly ash (Class F) and a subbituminous coal fly 
ash (Class C) highlighting their powdery form. 



Fly Ash. Figure 2 

Photos of small piles of bituminous coal fly ash (Class F) 
and subbituminous coal fly ash (Class C). Photos courtesy 
of the Illinois State Geological Survey 


Fly ash consists of fine, powdery particles with 
diameters ranging from <1 to 150 pm. The particles 
are predominantly spherical in shape and mostly glassy 
(amorphous, noncrystalline) in nature, because the 
heat generated during coal combustion is high enough 
to cause partial melting of many of the minerals present 
in the coal, and the materials quickly solidify from the 
molten state while suspended in the flue gas. The color 
of fly ash can vary from tan to gray to black, depending 
on the amount of unburned carbon in the fly ash. The 
darker colors generally indicate greater carbon content. 
Lignite or subbituminous coal fly ash is typically low in 
unburned carbon (LOI) content and high in lime (CaO) 
content (Table 2) and is a light tan to buff color. Most 
bituminous coal fly ash is typically gray in color, with 
lighter shades of gray generally indicating a higher 
quality and lower unburned carbon content fly ash. 

Fly ash contains a mixture of crystalline and amor¬ 
phous materials. Figure 3 shows a typical X-ray spectro¬ 
gram of a dry and a ponded fly ash sample. The crystalline 
components include a-quartz (Si0 2 ) that escaped melt¬ 
ing and minerals such as mullite (3A1 2 0 3 • 2Si0 2 ), 
hematite (Fe 2 0 3 ), and magnetite (Fe 3 0 4 ) that formed 
at high temperature during coal combustion. 

Figure 4 shows SEM images, indicating spherical 
particles, of a bituminous coal fly ash sample at 3,000 x 
magnification (A) and at 6,000x magnification (B). 
Some fly ashes contain a small proportion of particles 
that are hollow spheres filled with air/gas that will float 
on the surface of fly ash ponds. These particles, known 
as “cenospheres,” are collected and used in some spe¬ 
cialized applications (see section “Applications”). 

ASTM (American Society for Testing and Mate¬ 
rials) C 618 provides specifications for fly ash and raw 
or calcined natural pozzolan for use as mineral admix¬ 
tures in Portland cement concrete [18]. It also specifies 
two classes for fly ash, F and C, on the basis of their 
chemical composition, which mostly results from 
burning different types of coal. As shown in Table 2, 
the chemical composition of fly ash is generally 
reported in terms of oxides that include silica (Si0 2 ), 
alumina (A1 2 0 3 ), and the oxides of calcium (CaO), 
iron (Fe 2 0 3 ), magnesium (MgO), sulfur (S0 3 ), sodium 
(NaO), and potassium (K 2 0). Class F fly ash consists of 
at least 70% of combined Si0 2 , A1 2 0 3 , and Fe 2 0 3 , 
whereas, Class C fly ash must have at least 50% of 
combined Si0 2 , A1 2 0 3 , and Fe 2 0 3 . 
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Fly Ash. Table 2 Chemical compositions (wt%) of typical fly ash from burning coal of different ranks and the require¬ 
ments of the ASTM C 618 classification 



Bituminous 3 

Subbituminous b 

Lignite 0 

ASTM C 618 specifications 

Compound 

Class F 

Class C 

Class C 

Class F 

Class C 

Silica (Si0 2 ) 

57.0 

34.6 

28.9 



Alumina (Al 2 0 3 ) 

25.3 

17.8 

11.1 



Ferric oxide (Fe 2 0 3 ) 

5.3 

6.1 

11.1 



Si0 2 +Al 2 0 3 + Fe 2 0 3 

87.6 

58.5 

54.1 

70.0 min 

50.0 min 

Lime (CaO) 

2.0 

27.2 

18.9 



Magnesia (MgO) 

1.2 

5.8 

6.0 



Sodium oxide (Na 2 0) 

0.9 

2.3 

8.0 

1.5 max 

1.5 max 

Potassium oxide (K 2 0) 

3.1 

0.5 

0.9 



Sulfur trioxide (S0 3 ) 

0.2 

2.8 

6.8 

5.0 max 

5.0 max 

Loss on ignition (LOI) 

3.1 

0.9 

0.4 

6.0 max 

6.0 max 

Moisture content (%) 

NV 

NV 

NV 

3.0 max 

3.0 max 


NV not available 
Illinois Basin coal fly ash [19] 
b Powder River Basin coal fly ash [19] 
c North Dakota lignite coal fly ash [20] 



Fly Ash. Figure 3 

X-ray spectrogram of a dry fly ash sample and a ponded fly ash sample showing peaks for mullite (M), quartz (Q), hematite 
(H), and magnetite (Mg). The broad "hump" in the background between about 13° and 30° is due to abundant 
noncrystalline glass in the samples (printed with permission of the Illinois State Geological Survey) [17] 
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Fly Ash. Figure 4 

SEM images of a bituminous coal fly ash sample at 3,000x magnification (a) and 6,000x magnification (b). Photos courtesy 
of the Illinois State Geological Survey 


It should be noted that the LOI value correlates very 
well with the unburned carbon content, but it is not an 
actual measurement of the unburned carbon content in 
the sample. Burning lignite and subbituminous coals 
generally produces Class C fly ash that contains more 
calcium and magnesium and less unburned carbon 
(LOI) than Class F fly ash, which commonly results 
from burning bituminous and anthracite coal. 
Class F fly ash is typically pozzolanic in nature, but 
not cementitious by itself. Pozzolanic material contains 
glassy silica and alumina that in itself has little or no 
cementitious value, but in finely divided form and in 
the presence of water and free lime, will chemically 
react with the calcium hydroxide in the lime at ordi¬ 
nary temperatures to produce calcium aluminosilicate 
hydrates (cementitious compounds). On the other 
hand, typical Class C fly ash, in addition to having 
pozzolanic properties, has some cementitious proper¬ 
ties, particularly the ability to increase concrete com¬ 
pressive strength in the presence of water alone, 
because of its higher lime (CaO) content. ASTM 
C 618 also sets the maximum acceptable LOI value 
(indicating unburned carbon content) at 6% for both 
Class F and Class C ash in concrete applications 
(Table 2) (see section “Concrete (Conventional) and 
Cement” for more discussion). 

For use in Portland cement concrete, fly ash must 
meet the chemical (Table 2) and physical (Table 3) 
requirements specified by ASTM C 618. ASTM 


specification C 311 provides standardized test methods 
for sampling and testing fly ash or natural pozzolans 
for use in Portland cement concrete [21]. 

Specifications have also been established in other 
countries or by industrial sectors [22-24]. In Canada, 
for example, specifications for the use of fly ash in 
concrete are given in CSA A3001 [25], which classifies 
fly ash further as Types F, Cl, and CH based on the 
calcium oxide content. Type F refers to fly ash with 
a CaO content less than 8%, Type Cl has a CaO content 
ranging from 8 to 20%, and Type CH fly ash has a CaO 
content greater than 20%. Higher CaO content generally 
indicates that the fly ash has higher cementitious prop¬ 
erties. Two types of fly ash are produced and used 
in Canada, Type F and Type CI. Type CH fly ash is 
imported from the USA and used in concrete applica¬ 
tions in Canada [26]. It is important to recognize 
that not all fly ashes meet the ASTM C 618 specifica¬ 
tions. For applications other than concrete, fly ash 
may not need to meet ASTM C 618 requirements. 
Specifications of fly ash for other applications have 
also been developed, such as ASTM D 5239 for soil 
stabilization [27] and ASTM E 2277 for structural fill 
applications [28]. 

Applications 

Fly ash, because of its mineralogical composition, fine 
particle size, and amorphous character, is commonly 
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Fly Ash. Table 3 Physical requirements of the ASTM C 618 specification [18] 


Physical requirements 

ASTM C 618 specifications 

Class F Class C 

Amount retained when wet sieved on 45 pm sieve, max (%) 

34 

34 

Pozzolanic activity index, with Portland cement at 28 days, min (%) of control 

75 

75 

Pozzolanic activity index, with lime at 7 days, min (MPa) 

5.5 

- 

Water requirement, max (%) of control 

105 

105 

Autoclave expansion or contraction, max (%) 

0.8 

0.8 

Specific gravity, max variation from average (%) 

5 

5 

Percentage retained on 45 pm sieve, max variation,% points from average 

5 

5 


pozzolanic and in some cases is also cementitious, 
whereas, bottom ash is much coarser and is not pozzo¬ 
lanic in nature. Therefore, it is important to distinguish 
fly ash from bottom ash. In addition, fly ash produced 
by different power plants commonly has different levels 
of pozzolanic and cementitious properties; therefore, 
fly ash must be analyzed and assigned for use in specific 
applications according to their analysis. 

Fly ash has many established applications - mainly 
as an ingredient in concrete/cement products (see section 
“Concrete (Conventional) and Cement”), in stabilizing 
structural fills and embankments [29, 30], and in 
road bases and subbases [31]. Other uses are as 
a supplementary cementitious material in grout [32], 
soil stabilization [27, 33], and inclusion in flowable 
fill [34]. 

Fly ash also is used as mineral filler in asphalt 
concrete [35], roller compacted concrete [36-38], 
cured blocks, and fired brick products (sections 
“Autoclaved Aerated Concrete (AAC)” and “Ceramic 
Brick”), and also in the production of rubber [39, 40], 
paint [41, 42], and geopolymers [43-45]. It also has 
been found useful in agriculture [46-50] and in liquid 
and solid waste management [51, 52]. 

Another use for fly ash is for the extraction and 
recovery of high-value metals [53-58] that are minor 
or trace components of fly ash, such as magnetic iron 
(Fe), Aluminum (Al), Vanadium (Va), Strontium (Sr), 
and rare earth metals, such as Gallium (Ga) and 
Germanium (Ge). 

Fly ash generated from some pulverized-coal-fired 
power plants contains particles of cenospheres 


(gas-filled glassy spheres that float in water), which 
can be easily collected from the surface of ash ponds, 
or separated from dry fly ash using density separation 
device. Cenospheres have many unique properties 
including their light weight and hollowness, and almost 
perfectly spherical shape. They also have high strength 
compared to ordinary glass, and chemical resistance 
that enable them to be used in a wide range of com¬ 
mercial and industrial applications. They are especially 
useful as fillers in materials where their high strength, 
thermal insulating properties, and resistance to high 
temperatures are important. For more characteristics, 
advantages, and applications of cenospheres contact 
Fillite and Sphere Services, Inc. [59]. 

Fly ash applications by category and subcategory in 
various countries from most recent available sources 
[2, 7, 60-62] are tabulated in Table 4. (Information 
about such use categorization in China was not 
found.) The data indicate that the main use of fly ash 
falls into two categories: (1) concrete/cement products 
and (2) structural fill, embankments, and road base. 
In all cases, the use of fly ash as a component in 
concrete/cement products exceeds any other single 
application. The proportion of recycled fly ash used 
in concrete/cement products was 49.0% in India, 
52.3% in the USA, 74.5% in the European Union coun¬ 
tries, 85.4% in Canada, and 87.0% in Australia. 

The governments of major fly ash-producing coun¬ 
tries continue to promote use of coal fly ash and other 
CCPs through mandatory or incentive programs and 
research funding. Researchers continue to develop new 
and improved applications that allow fly ash to be used 

















Fly Ash. Table 4 Fly ash production and application by category and subcategory in various countries 
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in greater quantities and in more valuable ways. A brief 
overview of the most often used and potentially high- 
volume established construction product applications 
of fly ash are described below. 

Concrete (Conventional) and Cement 

Historical background - Ash erupted from nearby vol¬ 
canoes such as Mounts Etna and Vesuvius was used 
extensively in the construction of Roman structures. 
The Pantheon, which was built 2,000 years ago and is 
still standing today, is a classic example of durability 
achieved by using volcanic ash (see photo at http:// 
www.acaa-usa.org). Fly ash, which was recognized as 
possessing pozzolanic properties similar to those of the 
volcanic ash that made ancient concrete structures 
durable, was used as an ingredient in concrete as early 
as 1914 [63]. However, the first comprehensive study of 
fly ash in concrete was described by R. E. Davis and 
others in 1937 in the USA [64]. Davis and his 
coworkers [64-66] studied the effects of the fineness, 
loss on ignition (LOI), and moisture content of fly ash 
on the weathering resistance and other properties of 
concrete and Portland cement. Their work provided 
a foundation that continues to be developed worldwide 
into modern specifications and quality assessments of 
fly ash-based products. 

For example, the first British standard for the use of 
fly ash in concrete was established in 1965 and the first 
edition of standard BS 3892 [67] was as a fine aggregate 
(inert filler) and not as supplementary cementing 
material. During the time that research was underway, 
several comprehensive research projects were being 
conducted by the UK power industry to examine how 
base-load pulverized coal combustion conditions and 
fly ash collection systems influenced the physical and 
chemical properties of the fly ash produced and the 
results of those studies were also published in 1965 
[68]. Standard BS 3892 was revised to include fly ash 
use as a cementitious component in structural concrete 
in 1982 [69]. Part I required fly ash to have a fineness 
limit of 12.0% by mass retained on a 45 pm sieve and 
a LOI of less than 7.0%. This standard was further 
revised in 1997 with the same general property require¬ 
ments, but the particle size range was expanded to 
allow up to 40% of the mass to be retained on 
a 45 pm sieve [70]. The European fly ash for concrete 


standard, BS EN 450, was established in 1994-1995 
[71] and this standard was further revised into BS EN 
450 -1 [72] and BS EN 450 -2 [73] to include defini¬ 
tions, specifications and conformity criteria, and eval¬ 
uation, with some changes in fly ash categorization. 

The construction of the Hungry Horse Dam in the 
USA completed in 1953, which used about 120,000 
metric tons (132,240 short tons) of fly ash in the 
more than 3 million cubic yards of concrete to build 
dam, was the first major large-scale project that paved 
the way for using fly ash in the concrete industry [ 74, 75 ]. 
At least 100 other major locks and dams have been 
constructed using fly ash-containing concrete under 
the direction of the US Army Corps of Engineers, the 
US Bureau of Reclamation, or private engineering 
firms. Some of the most prestigious construction pro¬ 
jects of recent times have relied on concrete containing 
fly ash, including dams, power stations, offshore plat¬ 
forms, tunnel, walls, highways, airport runways, com¬ 
mercial and residential buildings, bridges, pipelines, 
and silos. For worldwide examples and photos of 
some outstanding applications of fly ash in the con¬ 
struction industry see the ECOBAWeb site [76] and the 
ACAAWeb site [2]. 

Benefits - The unique benefit of using fly ash in 
concrete is that fly ash, like volcanic ash, is capable of 
reacting with calcium hydroxide (free lime) at ordinary 
temperature. When fly ash is used in combination with 
Portland cement, calcium hydroxide librated from the 
hydration of Portland cement reacts with the alumino¬ 
silicates present in the fly ash to form calcium alumi¬ 
nosilicate hydrates (pozzolanic reaction products) 
possessing cementitious properties. This not only 
reduces the excess amount of calcium hydroxide 
(a nondurable by-product from hydration of Portland 
cement), but, in the pozzolanic reaction process, con¬ 
verts the calcium hydroxide into useful calcium alumi¬ 
nosilicate hydrates, which are the strongest and most 
durable portion of the paste in concrete. Also, pozzo¬ 
lanic reactions are much slower than cement hydration 
reaction thus the strength of concrete products is 
improving continuously. 

Fly ash not only has been successfully used world¬ 
wide as a mineral admixture in Portland cement con¬ 
crete for more than half of the past century, but also has 
been well established as a cement raw material and as 
a component of blended cement. 
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Fly ash used in concrete has been shown to have the 
following benefits: (1) decreases concrete’s permeabil¬ 
ity, thus reducing the rate of penetration and erosion by 
water and resisting harmful alkali-aggregate and sulfate 
reactions; (2) increases the concrete mix’s cohesiveness, 
thus reducing segregation and bleeding in the concrete; 
(3) contributes to long-term strength improvement in 
concrete; (4) improves the workability of fresh con¬ 
crete, enabling more thorough compaction, and 
(5) reduces and delays the onset of heat of hydration 
in massive concrete structures such as dams and con¬ 
sequently the risks of thermal cracking of the concrete 
[77-80]. 

Overall, the greatest benefits in using fly ash in 
concrete/cement applications are environmental. Fly 
ash is itself a waste by-product material from coal- 
fired power plants. In the past, the impoundment and 
land disposal of raw fly ash has covered millions of 
acres of valuable land in the major fly ash-producing 
countries. Utilization of fly ash in valued products will 
reduce this landfill. Furthermore, when 1 ton of cement 
is manufactured, 0.85 t of C0 2 (a green house gas) are 
also produced and released into the atmosphere [81]. 
Using fly ash as a cement raw material or as 
a component of blended cement can reduce cement 
production, and thus reduce green house gas 
emissions. 

Fly ash properties in concrete applications - Fly ash 
properties may vary depending on the type of coal used 
and its chemical compositions and the burning condi¬ 
tions at a specific power plant. Class F fly ash, which 
contains low lime (CaO) content, is typically a result of 
burning anthracite or bituminous coal, whereas, high- 
lime Class C fly ash is generally from burning lignite or 
subbituminous coal. Fineness, loss on ignition (residue 
unburned carbon content), and chemical composition 
are the key elements of fly ash affecting its concrete 
applications. Fine fly ash generally gives resulting con¬ 
crete high in compressive strength [82, 83]. In many 
concrete formulations, air bubbles are entrained into 
the mixture through mechanical agitation and the air 
bubbles are stabilized by an addition of specialized 
surfactants known as air-entraining admixtures 
(AEAs). The presence of small air bubbles increases 
the workability and cohesion of the concrete mix and 
improves the resistance of the set material to freeze- 
thaw cycles. The proper dosage of AEA is critical in 


generating the desired amount of entrained air and 
stabilizing the air bubbles during mixing and transpor¬ 
tation to the construction site [84] . Since the unburned 
carbon can adsorb the AEAs that are added to the 
mixture, the loss on ignition value of fly ash is restricted 
to no more than 6%. In terms of chemical composition, 
the fly ash used in concrete applications normally con¬ 
tains less than 10% of lime (CaO). Excessive lime in fly 
ash lowers the pozzolanic reactivity and affects the 
volume stability of the concrete. As mentioned in the 
section “Fly Ash Properties and Classification,” fly ash 
used in Portland cement concrete applications must 
meet the chemical and physical requirements of 
ASTM C 618 or the specifications of other jurisdic¬ 
tions. Also, it must be in a dry form and its pozzolanic 
reactivity and quality consistency must be monitored 
closely. Those who want detailed information on fly ash 
for concrete and cement products can find references 
from Headwaters Resources [85], Turner-Fairbank 
Highway Research Center’s coal fly ash user guide- 
Portland cement concrete [86], and from the book Fly 
Ash in Concrete by Gordon and Breach Science Pub¬ 
lishers [87] . Other publications on the same subject can 
be obtained from the organizations listed in “Useful 
Resources” at the end of this entry. 

Autoclaved Aerated Concrete (AAC) 

Autoclaved aerated concrete (AAC), also known as 
autoclaved cellular concrete and porous concrete, is 
a lightweight building material that is made without 
any coarse aggregate. The manufacturing process of 
AAC is shown in Fig. 5. 

AAC is produced from a mixture of a silica-rich 
material (such as finely ground sand or fly ash as the 
major ingredient), lime and/or cement (as a binding 
agent), water, and very small amount of aluminum 
powder (as an expansion agent). The mixture is poured 
into large molds and allowed to expand as bread 
dough, forming a highly porous, but impermeable 
cellular structure (see Fig. 6). The cake (green aerated 
concrete) is separated from the molds and cut into 
standard size and shape for curing in a pressurized 
autoclave chamber at a temperature about 375° F. The 
ingredients combine during steam curing to form cal¬ 
cium silicate hydrates (the minerals typically formed 
from Portland cement), which give AAC its strength. 
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Manufacturing Process 
Note! Follow steps 1-11, then steps A-E 
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Fly Ash. Figure 5 

Typical manufacturing process for producing autoclaved aerated concrete (AAC) (printed with permission of OSA Inc. [88]) 



Fly Ash. Figure 6 

The highly porous structure of two AAC blocks produced at the Illinois State Geological Survey containing 64 wt% fly ash 
obtained from two different power plants (Photos courtesy of the Illinois State Geological Survey [90]) 


AAC has been produced in many countries for more 
than 70 years because it offers considerable advantages 
over other construction materials [89]. AAC is only 
one-fourth to one-third the weight of conventional 


concrete, has excellent thermal and sound insulation 
properties, is nonflammable, and insect resistant. 

Sand-based AAC was developed during the 1920s 
(patented 1924), when Sweden was enduring an 













































Fly Ash 


3833 



extreme shortage of wood due to deforestation. In need 
of an alternative building material, Swedish architect 
Johan Eriksson developed AAC to meet the growing 
demand for a product that had wood’s workability 
coupled with excellent thermal insulation, structural 
strength, and superior fire resistance. To further 
improve AAC, Hebei invented and patented a method 
of incorporating steel reinforcements during the 
manufacturing process in 1945 [89]. Since commercial 
production of the AAC material began in 1930, AAC 
has become one of the most commonly used building 
materials in Europe and its use is rapidly growing in 
many other countries. Fly ash can be used to substitute 
for the sand in the mixture, typically at 55-65% by dry 
weight. During the last decade, fly ash has become 
a widely used replacement for ground sand for making 
AAC in many countries. 

In 1992, the UK used about 800,000 t of fly ash per 
year in the manufacture of approximately 1.5 million 
cubic meters of AAC, produced in ten factories [91]. 
In Czechoslovakia, six plants produced 2 million cubic 
meters of AAC, and 70% of this output was produced 
with fly ash [92]. The use of AAC has grown in recent 
years because of AAC’s excellent thermal insulating 
properties and other economic and environmental 
advantages. In some countries, the demand for AAC 
products has exceeded its rate of production [93]. 
Currently, there are over 300 AAC manufacturing 
plants on six continents [94]. However, sand-based 
AAC production in the USA was not initiated until 
1996 when ASTM commercial standards for AAC 
were established. There are currently three ASTM stan¬ 
dards available for AAC - ASTM Cl386 for material 
properties, ASTM Cl452 for reinforced AAC panels, 
and ASTM C1555 for AAC masonry units [95]. 

In the 1990s, several studies were conducted in an 
attempt to establish fly ash-based AAC plants in the 
USA. A collaborative study led by the Electric Power 
Research Institute (EPRI) was begun in 1993. The pro¬ 
ject constructed an on-site AAC mobile pilot plant and 
tested various coal ashes for making AAC at nine utility 
plants of seven utility companies, including Tennessee 
Valley Authority (TVA) and Ohio Edison Co. [96]. 
Another study was initiated in 1997 by Babb Interna¬ 
tional who built an AAC pilot plant in Ringgold, 
Georgia, [97] for the purpose of conducting test runs 
for making AAC using coal fly ash. Subsequently 


during 1998-2000, Babb International built a commer¬ 
cial AAC plant in Indiana for producing fly ash-based 
AAC. However, both plants were closed in 2003, due in 
part to the slow acceptance for AAC in the USA [88]. 

From 1997 to 2005, six sand-based AAC plants were 
constructed in the USA but, as of 2009, only three of the 
six remain in limited operation serving a small AAC 
market. Extensive commercial use of AAC in the USA 
has not yet been achieved due in part to the low cost of 
lumber and the resistance of the home and commercial 
construction industry to switching raw materials and 
changing construction methods. However, this resistance 
may soon be overcome when architects, builders, and 
the public become more aware of the many benefits of 
using energy-efficient AAC materials. In 2005, 
OSA Inc. acquired the Ringgold AAC plant and, in col¬ 
laboration with the Illinois State Geological Survey, devel¬ 
oped an economical manufacturing method (2009 patent 
pending) for fly ash-based AAC production [98]. OSA 
Inc. is currently working with a local government in 
producing enough ash-based AAC materials for mar¬ 
keting analysis. 

Ceramic Brick 

The term “bricks” typically refers to ceramic (fired) 
bricks, the standard small building units in 
a rectangular shape, but the term has also been used 
to refer to unfired small building units in a rectangular 
shape, such as small concrete blocks, or (aerated or 
non-aerated) sand-lime bricks [99]. This section 
mainly considers the use of fly ash in making standard 
ceramic (fired) bricks. 

In the process of manufacturing fired bricks, there 
are two critical steps - forming firmed green bricks and 
firing them successfully. Clay and shale are the standard 
raw materials used in making conventional fired bricks. 
In general, green bricks are formed by two basic 
methods - mold-pressing (dry or wet) or extrusion 
(under vacuum) - and are then dried. Firing of the 
dried green bricks is done in a kiln or oven with proper 
temperature control for successful firing. In a typical 
firing cycle of progressively increasing temperature, the 
temperature rises fairly rapidly from ambient condi¬ 
tion to a temperature of about 1,000°F and then 
at a slower rate up to about 1,500°F until the carbona¬ 
ceous matter in clay/shale is oxidized completely [100]. 
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Insufficient oxidation of the carbonaceous matter in 
the clay/shale mixture results in bricks produced with 
black or red cores. Also, improper rate of heating would 
result in defective bricks with cracks or bloating 
beneath the surface skin, which are commercially unac¬ 
ceptable. Iron oxide in the raw material is the primary 
cause of the typical redbrick color. 

Concrete blocks are manufactured in a much wider 
range of shapes and sizes than clay bricks and are 
available with a wider range of treatments. A number 
of the treatments are used to provide a final size, color, 
and shape similar to that of fired clay bricks. These 
small concrete building blocks in a rectangular shape 
also are referred to as bricks. In this case, the “green” 
concrete bricks are formed in steel molds by vibration 
or by compaction with a hydraulic press. When dry, the 
green bricks rather than being fired are cured under 
low-pressure steam at ambient temperature to gain 
strength/hardness and other properties. 

Fired brick manufacturing operations in most part 
of the world are labor- and energy-intensive. Brick 
manufacturers in developing countries, having cheaper 
labor, still rely heavily on mold-pressing of green bricks 
manually or by machine. In comparison, manufac¬ 
turers in developed countries mainly utilize machine 
extrusion to form green bricks and tunnel kiln firing. 
A typical extruder machine produces about 8,000 green 
bricks per hour, which would take a team of two skilled 
brick makers 10 days to produce using mold-pressing 
method [101]. In 1998 in the USA, according to 
a survey conducted by the Brick Industry Association, 
93.2% of total regular clay brick production was 
formed by extrusion. A machine-molded method was 
used for making specialty bricks that constituted 5.3% 
of the total production, and only 1.5% of all the 
manufactured bricks were produced by the dry¬ 
pressing method commonly used in developing coun¬ 
tries [102]. Using the extrusion method, as much as 
70 wt% of fly ash can be used to substitute for clay or 
shale; nearly 100% by weight can be substituted when 
using a mold-pressed method. 

Fly ash and bottom ash were used in making 
ceramic (fired) products such as building bricks, 
pipes, and roof tiles as early as the 1930s, but the 
practices used and any valuable technical information 
gathered, in most cases, were not documented. These 
early trials generally took place as a result of informal 


initiatives between a brick company and a local power 
plant and they had little incentive to preserve or pub¬ 
lish results. 

A brief historical review of making fired brick using 
fly ash was first conducted in the UK by Anderson and 
Jackson of the North Staffordshire Polytechnic in 1987 
[103]. Their review indicated that despite successful 
trials and technical promotion during the late 1950s- 
1960s, the response by the brick industry to use fly ash 
in brick production was poor due in part to: (1) the 
high demand for fired bricks at that time, (2) the low 
cost of conventional raw materials, (3) the difficulties 
encountered in some field trials when making bricks 
containing fly ash, and (4) reservations about the con¬ 
sistency in the quality of the fly ash supply [103]. 

In 1977, Slonaker experimented with making fired 
bricks comprised of mainly of Class F fly ash generated 
from the West Virginia and Pennsylvania coals. He used 
a dry mold-pressed method to form green bricks, and 
acceptable bricks were produced from feeds consisting 
of as much as 72% fly ash, 25% bottom ash, and 3% 
sodium silicate [104]. However, Slonaker’s study did 
not lead to commercialization. His fly ash bricks did 
not have conventional redbrick color, and his bricks 
with a composition of nearly 100% ash probably lacked 
sufficient plasticity to be extruded well. 

In 2005, Freight Pipeline Company (FPC) studied 
a mold-pressed method for making green bricks com¬ 
posed of almost 100% Class C fly ash. The brick 
strength, rather than firing, was developed by water- 
steam curing at room temperature [105]. Commer¬ 
cially available red-colored pigment was used as an 
additive to give the final brick products the traditional 
redbrick color. Recently, FPC issued its US patent 
license to CalStar Cement, a California-based com¬ 
pany, for further development. 

During 2000, researchers at the Illinois State 
Geological Survey, University of Illinois, began to 
experiment with using local Class F fly ash for 
manufacturing fired bricks. Fly ash was obtained as 
dry fly ash from hoppers or as ponded fly ash from 
a power plant pond or a stockpile. A systematic labo¬ 
ratory study on extruded fired bricks containing up to 
70 wt% dry fly ash showed that the fired bricks had 
greater compressive strength (Fig. 7) [106] and lower 
thermal conductivity [107] than conventional bricks 
containing no fly ash. 
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Compressive strength of fired bricks made with increasing amounts of fly ash in clay brick formulation. MFA, Plant-1 dry fly 
ash; EFA, Plant-2 dry fly ash. Figure courtesy of the Illinois State Geological Survey [106] 



In further studies with industry partners, pilot- 
scale and commercial-scale tests, formulations, and 
procedures for producing commercial-size building 
and paving bricks containing amounts of fly ash 
were optimized to suit the plant’s manufacturing 
processes. A handful of minor issues encountered in 
pilot-scale trials were resolved and commercial-scale 
production runs were successfully completed at three 
local brick plants [ 108-1 10] . Figure 8 shows a photo of 
an extrusion test run producing three-hole green fly ash 
building bricks at 8,000 bricks per hour (700 bricks per 
car). Figure 9 shows four batches of fired bricks suc¬ 
cessfully produced with fly ash inputs at 0% (El), 20% 
(E2), 30% (E3), and 40% (E4) by volume. These stud¬ 
ies, supported by state and federal governments, 
showed that Class F fly ash from Illinois Basin bitu¬ 
minous coals is suitable for fired brick production, 
especially when wet fly ash from retaining ponds was 
used. This is because most of the scum is associated 
with soluble salts in the dry fly ash and has been 
leached out and removed in the pond water. Thus, 
the amount of scum-suppressing additive otherwise 
required can be reduced or completely avoided. 


Standard commercial-size fired bricks produced with 
clay/shale mixtures containing 30-40% by volume 
(28-37% by weight) of ponded fly ash showed color 
similar to conventional fired bricks and their engi¬ 
neering properties, including results of freeze-thaw 
tests, met or exceeded the ASTM C 62 commercial 
specifications [111]. Also, local builders acknowledged 
that their method for conventional brick construc¬ 
tion can be applied directly to the construction of fly 
ash bricks. 

A knee-high wall at the Museum of Science and 
Industry in Chicago (Fig. 10) was constructed using 
Illinois fly ash bricks produced from commercial-scale 
production tests. This was part of the Museum’s Clean 
Coal Technology Exhibition. The economic evaluation 
of the commercial-scale tests was optimistic [108]. An 
environmental assessment of fired bricks indicated that 
the fly ash-containing bricks were fully comparable to 
regular commercial bricks as environmentally safe con¬ 
struction materials [112]. 

Participating brick companies are seeking partners 
for a joint venture to commercially produce bricks 
containing fly ash. Only a few brick plants in the 
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Fly Ash. Figure 8 

Scale-up extrusion demonstration at a brick plant producing three-hole green fly ash building bricks at 8,000 bricks per 
hour (700 bricks per car) (photo courtesy of the Illinois State Geological Survey [110]) 



Fly Ash. Figure 9 

Four batches of fired building bricks produced from scale-up production test runs with fly ash inputs at 0% (El), 20% (E2), 
30% (E3), and 40% (E4) by volume (photo courtesy of the Illinois State Geological Survey [110]) 


USA are currently producing bricks containing small 
amounts (1-5%) of coal ash. In a sustainable world, 
this rate can be increased manyfold. 

Environmental Impacts and Considerations 

As mentioned earlier, use of fly ash as a recycled mate¬ 
rial can have economically and environmentally bene¬ 
ficial results. Use of fly ash as one of the ingredients in 
concrete to improve durability and service life is 


important not only to private owners, but also to 
governmental authorities responsible for the design, 
construction, maintenance, and repair of buildings, 
bridges, roads, and infrastructure. The use of fly ash 
in concrete enhances the recycled material content of 
a building and is recognized as a beneficial strategy for 
C0 2 reduction. The use of fly ash in concrete-related 
products and bricks is an effective and increasingly 
used environmentally responsible strategy to promote 
sustainability because it (1) eliminates the need to 
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Fly Ash. Figure 10 

A knee-high wall at the Chicago's Museum of Science and 
Industry constructed with fly ash bricks produced from 
commercial-scale production tests. This was part of the 
museum's Clean Coal Technology Exhibition (photo 
courtesy of the Illinois State Geological Survey) 

dispose of fly ash, a typically industrial waste material; 

(2) reduces the cement content of concrete, and thus 
the C0 2 generated from cement manufacturing; 

(3) reduces the amount of embodied energy in the 
concrete; and (4) conserves natural resources by 
replacing the materials that would otherwise have to 
be mined to make these products. Also, the trace ele¬ 
ments in the fly ash are stabilized in the concrete 
products, reducing the chances for releasing them 
into the environment [113]. A review on various 
reports and research papers in the UK regarding the 
environmental impacts of using fly ash was conducted 
by Sear and coworkers [114] in 2003. Their conclusion 
was that coal fly ash is an environmentally harmless 
material that can be safely used in bound (such as 
concrete products) and unbound (such as land recla¬ 
mation) applications. 

Potential health issues of using coal ash were also 
discussed in the Environmental Focus Report issued by 
the Electric Power Research Institute in 1998. The sub¬ 
jects included ash dust issue, leaching of coal ash from 
land applications and from products containing fly ash 
as well as skin contact and radioactivity of coal ash. It 
concluded that when handled and disposed properly, 
coal ash does not present a public health or environ¬ 
mental threat [115]. 

Having advantages in physical, chemical, and min- 
eralogical properties, many fly ashes are marketed for 


a wide range of engineering applications in construc¬ 
tion and other industries. However, large volume of 
these materials is still remaining for disposal every year 
in the USA and the other major fly ash-producing 
countries. The environmental disposal practices of 
these materials are important issues impacting pro¬ 
ducers and regulators. 

Physically, coal fly ash is a fine powdery material, 
and in its dry form, just like road dust, it readily 
becomes airborne even in a mild wind. Initially, coal- 
fired power plants released their fly ash together with 
flue gas into the atmosphere. The settlement of notice¬ 
able amounts of fly ash dust particles on the land had 
disturbed ecological systems and degraded agricultural 
land and soil quality. Health risks for utility workers 
from long-term exposure to fly ash dust, just like road 
dust, include diseases such as silicosis, fibrosis of the 
lungs, and bronchitis. Power plants eventually were 
mandated to use collection devices such as electrostatic 
precipitation systems to gather fly ash and keep it from 
being carried with exhaust gases out the stack. The 
general public does not now encounter coal ash in 
quantities that might result in health risks. 

During power plant operation, some plant workers 
may be exposed to appreciable amounts of fly ash, either 
as suspended particulates or in collected ash piles. Such 
workers include primarily ash haulers, ash silo operators, 
truck loaders and drivers, ash system inspectors, and ash 
collection system personnel. The main sources of ash 
dust are the ash transfer system and ash loading. Under 
these circumstances, the precaution measures issued by 
the health organizations similar to the US Occupational 
Safety and Health Administration must be followed. 
However, studies on potential health effects of workers 
in frequent contact with coal fly ash showed that routine 
operating activities did not produce hazardous expo¬ 
sures, and occupational health records for these types 
of workers do not show a higher incidence of respiratory 
problems than those of power plant workers who do not 
work as closely with coal ash [115]. 

Dust control measures are also required at fly ash 
disposal sites to prevent dust hazard conditions and 
health risks. For example, transportation of fly ash to 
the disposal site typically is done in a conditioned form 
(moisture content about 15-30%) and in covered 
trucks or rail cars. At the disposal site, as the fly ash 
dries it forms a crust or clumps and is less easily moved 
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by the wind. In many cases, soil is applied on top of the 
fly ash as a cover material. These processes are designed 
to keep fly ash dust at a low enough level to avoid any 
expected health impacts. 

Chemically, the main components of coal fly ash as 
indicated in Table 2 are about 60-90% silica, alumina, 
and iron oxides. Other metals and metal-like elements 
commonly are present only in trace quantities - arsenic, 
cadmium, beryllium, thallium, nickel, lead, manga¬ 
nese, chromium, selenium, zinc, and other metals. 
The primary health concern from the land disposal 
of fly ash is the potential for the trace elements listed 
above to enter nearby groundwater. Standardized 
leaching tests show that small amounts of toxic trace 
elements (particularly arsenic, selenium, lead, and mer¬ 
cury) may leach from fly ash that is in prolonged 
contact with ground water, but they do not migrate 
far from the ash site and are present in small enough 
concentrations to justify the regulatory classification of 
fly ash as nonhazardous solid waste [115]. Studies on 
various animals examining the toxic effects of ingesting 
fly ash constituents have not found pathological dam¬ 
age that would indicate potential human health prob¬ 
lems [115]. The majority of fly ash is not enriched in 
radioactive elements, nor is it more radioactive, com¬ 
pared to common soils or rocks. Also, radiation in fly 
ash and in products produced from fly ash is minimal - 
well below the US Environmental Protection Agency’s 
action standards [115, 116]. However, each fly ash 
should be analyzed on a case-by-case basis for potential 
toxins and radioactivity because coals come from many 
areas. Also, precaution measures should be undertaken 
when it is required by local governments. 

Fly ash is not currently regulated in the USA by the 
US Environmental Protection Agency (EPA) nor in the 
other countries as a hazardous waste material; however, 
it must be handled or disposed of properly to prevent 
air pollution (irritant dusts) and groundwater contam¬ 
ination. For example, it is important to periodically 
monitor drinking water wells near fly ash disposal 
sites to ensure that the concentrations of trace elements 
remain below the regulatory levels for safe long-term 
drinking and cooking. 

Another precautionary example for ensuring proper 
disposal of fly ash is evident by the failure of a retaining 
pond structure on December 22, 2008 at the Kingston 
power plant of the TVA in the USA. This structural failure 


led to the release of approximately 5.4 million cubic yards 
of impounded fly ash onto surrounding land and into the 
adjacent Emory River. The cost for clean-up is estimated at 
more than $1 billion, and costs of legal actions and settle¬ 
ment of possible lawsuits may add to this. This incident 
directly affects citizens/animals living in close proximity to 
the plant and the TVA, but also indirectly impacts all coal 
burning utilities and other large coal users throughout the 
USA. As a result of this incident, the US EPA instructed all 
power plant operators and companies to conduct on-site 
assessments to determine the structural integrity of their 
ash storage ponds, and take whatever measures were 
needed to ensure the integrity. Eventually, the US EPA 
may issue more rigorous regulations or may eliminate 
ash pond disposal altogether [117]. 

Future Directions 

Although the use of nonfossil energy sources such as 
nuclear or wind is growing, coal will continue to be the 
major fuel for worldwide power generation for some 
time to come [118]. While significant progress has been 
made in terms of reusing fly ash combustion residues in 
the world, maintaining the quality consistency of a fly 
ash supply, which is extremely important in many 
already established applications, would require new 
strategies. This is because the utility industry starts 
using new improved coal combustion technologies 
and using more renewable energy sources, and endures 
more stringent emissions control regulations. Major 
developments that would alter fly ash properties and 
impact the use of fly ash in manufacturing products 
include, but are not limited to the following. 

• The use of mercury reduction and/or NOx reduc¬ 
tion technologies (activated-carbon mercury 
adsorption, low NOx burners, selective catalytic 
NOx reduction, or selective non-catalytic NOx 
reduction), would produce fly ash with unaccept¬ 
ably high levels of unburned carbon [119] and, in 
some cases, adsorbed mercury or ammonia. 

• Switching coal sources and/or changing boiler oper¬ 
ating conditions in an effort to reduce the cost for 
electricity generation would result in fly ash with 
inconsistent properties. 

• Co-firing coal with other fuels (such as natural gas, 
wood, or other biomass) or other combustible 
wastes (such as petroleum coke, scrap tires, 
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municipal solid waste) would produce fly ash with 
inconsistent properties. 

• The use of advanced clean coal technologies, such as 
gasification of coal or burning coal/petroleum coke 
mixed material, would produce residues, mainly 
a hard glassy slag, as well as some fly ash with 
compositions much different from conventional 
pulverized coal combustion fly ash. 

Future developments for fly ash applications will 
require improved fly ash management strategies that 
will ensure the quality consistency of fly ash for new 
and established applications. These may include 
(1) blending and mixing of ashes for quality consis¬ 
tency, (2) particle size control by classification, grind¬ 
ing, and agglomeration, (3) physical or chemical 
processing for removal of residual carbon or ammonia, 
and (4) separation processing, including flotation and 
electrostatic or magnetic separation [ 120] . Some fly ash 
marketing companies have technologies ready for fly 
ash processing. But when additional treatment is not 
practical from an economic standpoint, the coal-fired 
power plant will have no other option other than land¬ 
fill disposal of the fly ash they produce. Managing fly 
ash in an efficient, environmentally sound, and cost- 
effective manner will remain a continuing challenge for 
the major fly ash producers, marketers, and regulators 
worldwide. 
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To many in the sustainability community, fossil energy 
is an anathema. Continued use of fossil resources - oil, 
coal, and natural gas - poses threats to the environment 
through the emission of pollutants and greenhouse 
gases. The fact that they are a limited or exhaustible 
resource means that in the future we could either run 
out of them or their extraction will get progressively 
harder to a point that it takes more energy to extract 
them than would be derived from their use. Using fossil 
energy is clearly not sustainable, and the world has to 
look to renewable resources for long-term survival. 

This is an encyclopedia about the science and 
technology of sustainability. The word, sustainability 
shares its root with sustenance. Any discussion of 
sustainability must therefore include discussions of 
sustenance, and in the context of modern society, 
sustenance stems from the use of energy. We derive 
energy from a number of different sources. Annual 
global consumption of energy is currently on the 
order of 500 exajoules (EJ), about 85% of which 
comes from fossil resources. From 500 EJ/year, the 
global energy demand is expected to rise to somewhere 
between 1,000 and 1,500 EJ/year by the middle of this 
century. The drivers for this increased demand are 
already in place. Large segments of China, India, and 
Brazil are poised to increase their standard of living - 
and the concomitant energy demand - substantially. If 
the average energy consumption in the Asia-Pacific 
region were to reach the current global average, which 
is about half of what is consumed in Europe and Japan, 
the demand would increase by an amount equal to the 
total energy consumption in the North America region. 
The challenge of achieving a sustainable future is in 
being able to balance the energy requirements for the 
billions of people so they can lead healthy and produc¬ 
tive lives against the need to preserve the environment 
by not running into its limits in its ability to supply the 
resources or act as a sink for the waste [ 1 ]. The entries 
in this section examine the current status, assess the 


resource potential of the various fossil fuels, examine 
the technologies for using them, and review the envi¬ 
ronmental impact of their use. 

Petroleum and natural gas have similar origin and 
often occur together in geologic formations, and their 
global distribution and production is discussed 
together in the entry by McCabe (► Oil and Natural 
Gas: Global Resources). This entry makes clear the 
distinction between reserves and resources. Reserves 
represent only that fraction of the resource base that 
can be economically recovered using current technol¬ 
ogy. These are not fixed quantities as both technology 
and economics change over time. Global annual pro¬ 
duction (and consumption) of oil in 2010 was 31 
billion barrels and of natural gas was around 120 tril¬ 
lion cubic feet. In energy units, they correspond to 
180 EJ of oil and 120 EJ of natural gas [2]. The current 
reserves are estimated at 1,236 billion barrels of con¬ 
ventional oil (7,500 EJ) and 6,545 tcf of natural 
gas (6,500 EJ). The current reserves to production 
ratio (R/P) is about 40 for oil and about 55 for natural 
gas. The R/P ratio has often been mistakenly taken as 
the time to exhaustion, but new discoveries as well as 
advances in technology add to the reserves. In the 
case of oil, for example, the R/P ratio has stayed around 
40-50 years for more than 60 years even with the 
steadily increasing oil consumption. In addition, 
there are also unconventional accumulations of these 
hydrocarbon resources and extracting them requires 
development of new technologies. In the case of oil, 
the unconventional resources are oil sands, oil shale, 
and heavy oil. Unconventional resources of natural 
gas are coal bed methane, tight gas, shale gas, 
and gas hydrates. These unconventional resources are 
vast and have the potential of more than doubling our 
resource endowment. 

Exploration and production of oil from sedimen¬ 
tary deposits and oil sands is the subject of an entry by 
Speight (► Petroleum and Oil Sands Exploration and 
Production). The processes for recovering oil could be 
a simple matter of drilling into the formation with the 
oil flowing to the surface under its own pressure, or it 
may require injection of gases, fluids, and surfactants to 
coax it to flow. In extreme cases, it may even require 
underground combustion of a portion of the resource 
to release the oil. Speight describes the chemical and 
physical factors that govern the flow of oil and the 
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technology options currently available. In a different 
entry, Speight (► Petroleum Refining and Environ¬ 
mental Control and Environmental Effects) provides 
and account of the different processes such as distillation, 
catalytic cracking, hydrotreating, reforming, and 
deasphalting used in the refining of crude oil. This 
entry also deals with environmental effects of the 
gaseous, liquid, and solid effluents from these processes. 
Production of oil from shale is principally achieved by 
retorting of shale, or other thermal processes including in 
situ pyrolysis. Oja and Suuberg (► Oil Shale Processing, 
Chemistry and Technology) detail the chemistry and 
technology of these processes in the entry. 

Gas hydrates represent a particularly noteworthy 
resource and are reviewed in the entry by Pa til (► Alaska 
Gas Hydrate Research and Field Studies). They are 
naturally occurring ice-like substances in which meth¬ 
ane or other light gases are trapped in the cage struc¬ 
tures formed by water molecules. They are stable under 
certain conditions of temperature and pressure, and 
can be found at many places around the world at 
depths of several hundred meters below the seabed 
and in the permafrost in the polar region. Global esti¬ 
mates of gas hydrates are on the order of 500,000 tcf, 
about a hundred times the proved reserves of natural 
gas. Apart from being a potentially very important 
source for energy in the future, the gas hydrates are 
also important from the perspective of climate change. 
A general warming could release substantial amounts 
of methane, which - being a potent greenhouse gas - 
would reinforce any global warming. 

Oil remains the prized fuel. It has a high energy 
density and as a liquid it is well suited for transporta¬ 
tion. The transportation sector relies on oil for over 
90% of its energy needs (the remainder being mostly 
furnished by coal - via electricity) [3]. Given the 
importance of liquid fuels, there is considerable interest 
in converting coal and natural gas, the other hydrocar¬ 
bon resources, into oil. In a pair of entries, Araso 
and Smith (► Gas to Liquid Technologies and ► Coal 
to Liquids Technologies) provide an overview of the 
various processes for converting natural gas and coal to 
liquids. These reviews cover the basic chemistry and 
catalytic technologies behind the different approaches. 
The entry on ► Gas to Liquid Technologies deals with 
steam reforming of methane, autothermal reforming, 
and partial oxidation approaches for producing syngas, 


including strategies for managing the heat and 
mass transfer through the use of different reactor 
technologies. The entry next covers the conversion of 
syngas into liquids by Fischer-Tropsch (FT) synthesis. 
The entry on ► Coal to Liquids Technologies reviews 
the different approaches like pyrolysis, direct liquefac¬ 
tion, coprocessing with petroleum, and indirect lique¬ 
faction that first converts coal into syngas and then uses 
FT or other conversion processes to make liquid fuels. 

The internal combustion engines that power these 
vehicles covert only about 20-30% of the energy in the 
fuel to motive power. Increasing the efficiency of the 
engines used in transportation represents a significant 
opportunity to reduce future demand for oil, as well as 
reduce the carbon footprint of the cars, trucks, and 
planes. Jacobs (► Internal Combustion Engines, Devel¬ 
opments in) provides a detailed account of the ther¬ 
modynamics of the different kinds of engines and of 
the various technologies under development for 
improving the efficiency of the internal combustion 
engines. These include strategies such as engine 
downsizing, turbocharging, better controls, variable 
geometry engines, variable valve timing, homogeneous 
charge compression engines, and waste heat recovery. 

Hook (► Coal and Peat: Global Resources and 
Future Supply) reviews the global coal and peat 
resources, which are substantially larger than conven¬ 
tional oil and gas resources. Peat and coal are also 
distributed more evenly around the globe. The proved 
reserves of coal are estimated at between 800 billion 
and a trillion metric tons, representing roughly 
20,000 EJ. Most coal is derived from trees and ferns 
that grew some 250-300 million years ago during the 
carboniferous period. Aerobic bacteria generally 
decompose most plant matter into C0 2 , but when 
a tree fell into a swamp and was buried with limited 
exposure to oxygen, it could be partially preserved. 
That phenomenon occurs even today and is evidenced 
in the formation of peat (very young coal) in bogs. 
Unlike coal, peat has a low calorific content and its 
commercial use as an energy resource is limited. 
However, peat is important from global carbon cycle. 
Peat bogs store vast amounts of carbon, but the 
product of anaerobic decomposition, methane, 
contributes to greenhouse gases in the atmosphere 
such that under steady state conditions they comprise 
only a slight sink of carbon. The situation changes with 
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fires as peat bogs become major source of carbon in the 
atmosphere. Human activities, such as drainage of the 
area for agriculture, exacerbate the situation, as more 
peat gets exposed and the swamp is not wet-enough to 
squelch the fire. Estimates of the amounts of carbon 
released from peat are on the order of several billion 
tons, the same order of magnitude as from fuels used in 
transportation or power production. 

As coal resources are vast, they are likely to continue 
to contribute substantially to global energy mix. How¬ 
ever, their use also presents a number of challenges, 
beginning with mining, and through coal preparation 
and use. Mining operations result in excavation of bil¬ 
lions of tons of earth, and depending on the nature of the 
waste rock and how it is handled it can lead to contam¬ 
ination of air, water, and soils systems. The wide-ranging 
nature of environmental impact necessitates a commen- 
surately broad portfolio of technologies for the remedi¬ 
ation, restoration and reclamation. Chattopadhyay and 
Chattopadhyay (►Mining Industries and Their Sus¬ 
tainable Management) provide an overview of the envi¬ 
ronmental impact of coal mining, and mining processes 
in general, as well as review the range of the chemical 
and biochemical strategies to mitigate the impact. 

The run-of-mine coal is often laden with 
noncombustible minerals such as shale and clays that 
reduce its heating value. Cleaning and washing of coals 
removes many of these impurities and upgrades the 
coal into a marketable resource, improve the perfor¬ 
mance of power plants, and also reduce potential of 
harmful emissions and dust. Luttrell and Honaker 
(► Coal Preparation) review these cleaning and prepa¬ 
ration operations, and point out the importance of 
cheap preparation in extending the resource base. 

Electricity is one of the most useful forms of energy; 
its importance to the way we live cannot be over 
emphasized. It can be used to perform all kinds of 
useful functions and services that people desire, and 
at the point of use it produces no pollution. Electricity 
is a secondary source of energy, as it must be produced 
from primary sources such as coal, natural gas, oil, 
nuclear, hydro and, of late, increasingly from wind 
and solar. In 2010, the global production of energy 
was 21,325 trillion kWh, which is equivalent to 77 EJ. 
However, since 68% of electricity is derived from 
burning fossil fuels at an average efficiency of 38%, 
the total primary energy the world consumed as 


electricity amounts to 135 EJ, or roughly 27% of the 
total energy consumed. Electricity demand in the world 
is projected to rise at a rate greater than for total energy, 
and meeting that need for additional sources for 
producing electricity is critical human welfare. 

Several entries in this section are devoted to the 
production of electrical power. As the fuel with 
the lowest levelized cost of electricity, coal remains the 
dominant fuel for producing electricity, and furnishes 
over half of the electricity. It results in the emission of 
about 900 g C0 2 per kWh, highest of any fossil 
resource. The development of coal power in future 
is going to be constrained by the concerns of C0 2 
emissions, and Beer (► C0 2 Reduction and Coal- 
Based Electricity Generation) discusses the different 
technology options for increasing the efficiency of 
coal-fired power plants and reducing the C0 2 emis¬ 
sions. Ultra supercritical steam cycles and combined 
heat and power applications are relatively straightfor¬ 
ward to apply, and have the potential to reduce the 
largest tonnage of emissions. 

Of course, there are other pollutants also that are 
emitted when coal is used for power productions. Chief 
among them are oxides of sulfur and nitrogen and toxic 
metals such as mercury, selenium, and arsenic, as well 
as carcinogenic PAH-containing soot aerosols. Moyeda 
(►Pulverized Coal-Fired Boilers and Pollution 
Control) reviews the technologies for scrubbing the 
stack gas emissions. The deployment of scrubbers for 
the nitrogen and sulfur oxides following the Clean Air 
and Water Act in the 1980s greatly improved the quality 
of air and water systems. 

Natural gas is the other major fuel for producing 
electric power. When used in a gas-fired combined- 
cycle mode, it produces only 380 g C0 2 /kWh. Smith 
and Giilen (► Natural Gas Power) review the technol¬ 
ogy for power generation from natural gas. They dis¬ 
cuss the different sources of natural gas, provide a brief 
history followed by detailed thermodynamics of gas 
turbines, and modern power plants designs that incor¬ 
porate combined cycle for maximum overall efficiency. 
They also review emissions from NGCC plants, notably 
NO x , and strategies for minimizing them. The entry 
concludes with a discussion of power plant economics. 

Carbon capture and sequestration (CCS) as a strat¬ 
egy for stabilizing atmospheric C0 2 levels is receiving 
considerable attention. Friedman (► C0 2 Capture and 
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Sequestration) provides an overview of the state of 
technology and the different embodiments of CCS. 
These cover various strategies for first capturing the 
C0 2 , which may be performed pre- or post-combus¬ 
tion, or from the atmosphere. He discusses the energy 
requirements under these different scenarios. The entry 
also reviews the geochemistry of various options for 
sequestering the captured C0 2 , either in deep saline 
aquifers or in other geological formations. 
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Electrochemical capacitor An electrochemical capaci¬ 
tor (supercapacitor, ultracapacitor or double-layer 
capacitor) is an electrochemical device that can store 
and convert energy by charging/discharging the elec¬ 
trochemical double-layer of 2 electrodes with large 
surface areas and thus large double layer capacitances. 

Battery A battery or Voltaic cell consists of one or 
more electrochemical cells which store and convert 
chemical energy into electric energy. 

Ragone plot A Ragone Plot compares the perfor¬ 
mances of different energy storing devices by plot¬ 
ting power densities or specific power [W/kg] 
versus energy densities or specific energy [Wh/kg]. 

Electromobility Electromobility is a mobility concept 
in which electric vehicles instead of vehicles 
powered by internal combustion engines are used. 
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Glossary 

Fuel cell A fuel cell is an electrochemical cell that can 
convert the chemical energy stored in a given fuel 
into electrical energy. 


The actual energy demand and consumption issues 
make it necessary to critically discuss and compare 
different energy conversion and storage systems. At 
present, only one third of the primary energy is 
converted into end energy, for example, electrical 
energy. Losses are associated with a high consumption 
of fossil fuels and large C0 2 emissions. They can be 
avoided by considering important electrochemical pro¬ 
cesses for energy conversion, using batteries, fuel cells, 
supercapacitors and electrochemical photovoltaics and 
by incorporating energy storage, employing recharge¬ 
able batteries, supercapacitors, generation of hydrogen 
via electrolysis, and generation of methanol. 

Fuel cells can offer clean power generation and have 
the potential to convert fuels directly into electrical 
energy with high efficiencies. Today, however, they 
cannot compete with heat engines because of much 
higher costs, inferior power performance, and insuffi¬ 
cient durability and lifetime. As yet, no single electro¬ 
chemical power source can match the characteristics of 
the internal combustion engine. A competitive system 
in comparison to heat engines can be envisioned when 
available electrochemical power systems are combined. 
In such hybrid electrochemical power systems, batte¬ 
ries and/or supercapacitors would provide high power 
and the fuel cells would deliver electricity with high 
efficiency. The consequence would be to considerably 
reduce costs of the electrochemical systems. 
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Moreover, when we compare different systems, we 
face the genuine problem of geometry and thus the 
costs related to the space needed for the processes 
involved. The latter explains the so far success of ther¬ 
modynamic-based systems (e.g., heat engines) com¬ 
pared to alternative energy technologies (e.g., fuel 
cells). This is also evident when the power output of 
a coal power plant which lies in the GW range is 
compared to the power that a fuel cell can deliver 
which lies in the MW range. A possible strategy would 
be to develop highly efficient fuel cell systems that can 
push their power to larger outputs. Current research is 
aiming at increasing the power output of a fuel cell 
without compromising its efficiency; however, the 
advantages of conventional systems cannot be ignored. 

Introduction 

Our present situation concerning major (fossil) energy 
carriers and energy consumption is characterized by 
limited reserves and resources as well as emission prob¬ 
lems, considering crude oil, natural gas, coal, and ura¬ 
nium as primary energy sources. At the same time, the 
worldwide demand for electrical energy has increased 
from 8.3 million GWh in 1980 to 18.9 million GWh in 
2006 and is estimated to further increase up to 30.7 
million GWh in 2030 (see [1] (Fig. 1). 
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Figure 1 

Worldwide demand for electrical energy (From [1]) 


The problem of the energy management is of 
a rather complex nature. Increasing contributions of 
renewable energies such as wind power, solar power, 
and wave power tend to complicate the grid manage¬ 
ment. Therefore, generation of electrical energy is only 
part of the challenge. The management and storage of 
electrical energy will become essential in order to main¬ 
tain the present grid quality [2]. 

Energy conversion processes today are under spe¬ 
cial consideration because of two issues associated with 
them: 

1. The limited availability of primary energy carriers 

2. The emission of pollutants with local and global 

negative effects on the environment 

On average, energy conversion processes aiming 
at generating electrical power reveal efficiencies not 
much higher than 30%. This indicates that losses in 
the form of heat or chemical substances amount 
to more than two thirds of the primary energy. Con¬ 
ventional processes, for example, those in a heat 
engine-based power plant, are volume processes such 
as combustion, which results in mechanical and then in 
electric energy. By contrast, other technologies such as 
photovoltaics, batteries, fuel cells, or supercapacitors 
are based on interfacial transfer of energy and/or 
charge. In contrast to the performance of heat engines 
which is limited by the Carnot efficiency, interfacial 
reactions are usually of much higher thermodynamic 
efficiency. 

Classical Heat Engines 

Thermodynamic considerations lead to the first heat 
engines that were used to generate mechanical or electri¬ 
cal energy. Since the eighteenth century, conventional 
reciprocating steam engines have served as mechanical 
power sources, with notable improvements being made 
by James Watt. The first commercial central electrical 
generating stations in New York and London, in 1882, 
used steam engines [3]. These first-generation heat 
engines are still serving as power plants today. 

Further developments lead to the introduction of 
combined cycle power plants. Here, usually 
a combination of several cycles, operating at different 
temperatures yields considerably higher system effi¬ 
ciency. Heat engines are only able to use a portion of 
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the energy, usually 35-41%. The remaining heat is 
generally wasted. 

In a combined cycle power plant (CCPP), or com¬ 
bined cycle gas turbine (CCGT) plant, a gas turbine 
generator produces electricity and the waste heat is 
used to make steam for generating additional electricity 
via a steam turbine; this last step enhances the effi¬ 
ciency for electricity generation to about 60%, because 
the temperature difference between the input and out¬ 
put heat levels is higher leading to an increase in the 
Carnot efficiency. Most modern power plants in 
Europe and in North America are of this type. 

If the waste heat of a conventional thermal power 
station is utilized for district heating, it is called cogen¬ 
eration. This heat gives an additional efficiency of 
about 40-50% which leads to an increase of the overall 
efficiency to ~90%. The fuels used in these engines 
comprise black and brown coal and nuclear power. 

In contrast to the stationary systems described above, 
mobile engines are needed in many energy production 
applications. The best known mobile engines are the 
four-stroke (Otto), the diesel, and the Stirling engine. 

Heat engines have limited efficiencies which are 
determined by the Carnot cycle. Practical issues reduce 
the efficiency of steam engines, due to limits of convec¬ 
tive heat transfer and viscous flow (friction). There are 
also mechanical considerations, for example, limita¬ 
tions imposed by the materials such as nonideal prop¬ 
erties of the working gas, thermal conductivity, tensile 
strength, creep, rupture strength, and melting point. 

Electrochemical Systems 

Energy Conversion and Storage 

Alternatively to conventional Carnot-based heat 
engines, electrochemical systems have the potential to 
be employed to generate and store energy. Important 
electrochemical processes for energy conversion are 
considered in batteries, supercapacitors, electrochemi¬ 
cal solar cells, and fuel cells, and energy storage is 
incorporated by employing rechargeable batteries, 
supercapacitors, generation of hydrogen via electroly¬ 
sis, and generation of methanol from electrochemically 
generated hydrogen and C0 2 /C0-containing synthetic 
gas. Electrochemical energy conversion systems are not 
subject to the Carnot cycle limitations and may operate 
with much higher efficiencies than combustion engines 


and related devices. Heat engine cycle processes are 
volume processes, whereas all electrochemical systems 
that are used for energy conversion and storage are 
based on reactions that take place at an interface. This 
makes them dependent on the surface morphology and 
its physical and chemical properties. 

A fuel cell is an electrochemical conversion device. It 
produces electricity from fuel on the anode side and an 
oxidant on the cathode side, which react in the presence 
of an electrolyte. Fuel cells can operate continuously as 
long as the necessary flows of reactants and reaction 
products are maintained. They are thus thermodynam¬ 
ically open systems. Many combinations of fuel and 
oxidant are possible. A hydrogen cell uses hydrogen as 
fuel and oxygen, usually from air, as oxidant. Other 
fuels include hydrocarbons and alcohols. Other oxi¬ 
dants include chlorine and chlorine dioxide. The prin¬ 
ciple of fuel cells is already known for 165 years. In 
1838, Christian Friedrich Schonbein discovered that an 
electrical voltage is measured between two platinum 
wires in an electrolyte solution surrounded by hydro¬ 
gen and oxygen. He published these results under the 
title “On the Voltaic Polarization of Certain Fluid and 
Solid Substances.” Sir William Robert Grove investi¬ 
gated the new effect intensely and was the first to 
develop useful fuel cells. He connected several elements 
in series and called these systems “gas batteries.” The 
further development of this concept to a really efficient 
electrical source proved to be so difficult that it took 
100 years until it improved essentially. After the design 
of the first electrical dynamo by Werner von Siemens in 
1867, electrical generators were energy sources that 
efficiently delivered electricity in almost unlimited 
amounts. This was the reason why fuel cells were prac¬ 
tically replaced completely. Fuel cells have the potential 
for high conversion efficiencies of chemical energy into 
electrical energy; at the moment the attainable effi¬ 
ciency is up to 65%, depending on fuel and conditions. 
Fuel cells can substitute other technologies or be oper¬ 
ated in synergy, for example, with combustion engines 
in cogeneration of electricity and of heat and cold, in 
residential or mobile applications. 

Figure 2 shows a comparison of the thermodynamic 
efficiencies of a hydrogen/oxygen fuel cell and of 
a Carnot cycle. In the future, fuel cells might be able 
to convert the used fuels into electrical energy with 
efficiencies of >70%. The difference between the 




3850 


F 


Fuel Cell Comparison to Alternate Technologies 



b 




>50% 

Fuel | 



O v 7 00% 

~ 40 % 

V 

xy 

Hot 

gas 

^ i 

<> 

Turbine 


N. 


- 100% 


Electrical 

energy 


<f 


Chemical Thermal Mechanical Electrical 
energy energy energy energy 


Fuel Cell Comparison to Alternate Technologies. Figure 2 

(a) Ideal thermodynamic efficiency of polymer electrolyte membrane fuel cells (PEMFCs) compared to that obtained in the 
Carnot process, (b) Comparison of processes in a cogenerated heat engine with fuel cell performance (From [2]) 


theoretical and practical energy storage capabilities is 
related to several factors, including (1) inert parts of the 
system such as conductive diluents, current collectors, 
containers, etc., that are necessary for its operation, 
(2) internal resistances within the electrodes and elec¬ 
trolyte and between other cell components, resulting in 
internal losses, and (3) limited utilization of the active 
masses, as, for example, parts of the fuel in a fuel cell 
leave the cell without reaction or as, for example, pas¬ 
sivation of electrodes makes them partially electro- 
chemically inactive. Fuel cells offer a very clean power 
generation. They are quiet in operation and can be 
located close to the application. They produce much 
less greenhouse emissions and can be more efficient in 
conversion of the energy in a fuel into power than 
gasoline engines or utility thermal power plants. Fuel 
cells are best utilized as a steady energy source and not 
as a power source to supply dynamic demands. For 
applications that require varying power demands, 
such as automotive propulsion, the use of the fuel cell 
in a hybrid configuration with a battery or an electro¬ 
chemical capacitor is envisioned. The fuel cell provides 
steady power delivery while the battery or the electro¬ 
chemical capacitor handles the surge for regenerative 
breaking and acceleration as well as initial start-up. 
There is a certain number of fuel cells under development. 
Usually fuel cells are classified by the electrolyte that 
they are using. The electrolyte is a major component of 
the fuel cell since it determines the operating tempera¬ 
ture, fuel, oxidant, poisons, catalysts, electrode design, 


and system design. According to the type of electrolyte 
that is used, fuel cells can be classified into six major 
categories. 

The polymer electrolyte membrane fuel cell 
(PEMFC) uses a solid-type electrolyte that typically can¬ 
not withstand high temperatures, and the relatively low 
operating temperature results in slow reaction rates. The 
operating temperature of a PEMFC is typically between 
50° C and 100°C. The slow reaction rate is overcome by 
using highly active catalysts such as platinum. The fuel 
utilized is pure hydrogen and the electrical power that it 
produces is typically less than 250 kW. The fuel 
and oxidant must be free of carbon monoxide, which 
is a poison because it gets strongly adsorbed by the 
catalyst reducing the active surface area. The main appli¬ 
cations for PEMFCs are for small stationary units and as 
power supplies in vehicles. The direct methanol fuel cell 
(DMFC) is similar to the PEMFC but methanol is used 
as a fuel instead of hydrogen. It can be used for portable 
applications such as laptops, cell phones, and MEMS 
devices. Maximum power of DMFCs is typically less 
than 10 kW. 

Figure 3 shows a typical polarization curve of a low- 
temperature PEMFC. The curve is similar for all the 
other types of fuel cells as same trends are observed. At 
high operation voltages, the power output of a fuel cell 
is controlled mainly by the electrocatalytic properties 
of the catalyst. In this region (>0.8 V), the current 
density is controlled by the charge transfer and the 
dissipated heat is minimized. Challenges in fuel cell 
















Fuel Cell Comparison to Alternate Technologies 


F 


3851 



> / [A cm 2 ] 



Fuel Cell Comparison to Alternate Technologies. Figure 3 

Conversion of chemical energy in fuel cells (voltage efficiency ?7voitage versus current density /). Black line : cell voltage U; 
gray line: power density P 


catalyst research are to obtain high electric power den¬ 
sity, high electric conversion efficiency, and low mate¬ 
rial costs. At high electric power density, mass transport 
is the dominant factor affecting the fuel cell operation. 
This region can be expanded by careful engineering 
design of the fuel cell system. A general research trend 
is to obtain as high current densities as possible at high 
operation voltages. 

For high electric conversion efficiency , one should 
aim at the use of less cogenerated heat which would 
also allow for a simplification of the design and the 
need of less fuel (hydrogen) for the same electrical 
energy. To guarantee low material costs, less noble 
metal should be incorporated, because high catalyst 
utilization is needed, and highly efficient catalysts are 
needed for anode and cathode side. The goal in fuel cell 
research must therefore be a deeper understanding of 
the parameters controlling electro catalytic activity 
which would enable us to propose rational structures 
for catalysts in fuel cells in the future. 

Alkaline fuel cells (AFCs) were used to provide 
electrical power for many manned spacecrafts. The 
electrolyte is KOH and the catalysts include silver, 
nickel, and different metal oxides. AFC catalysts are 
relatively inexpensive compared to catalysts used for 
other types of fuel cells. The fuel and oxidant used in an 
AFC must be completely free of C0 2 since even a small 


amount reacts strongly with the electrolyte, producing 
forms of carbonates that poison the ionic conductivity 
of the electrolyte. Therefore, pure hydrogen and oxygen 
must be used, limiting the use of the AFC to special 
applications, like spacecrafts and submarines, where 
cost of the fuel and oxidant is not a major issue. 

The phosphoric acid fuel cell (PAFC) was the first 
fuel cell to be commercialized and shares some tech¬ 
nologies with the PEMFC, such as the porous elec¬ 
trodes and the platinum catalysts. The liquid 
phosphoric acid allows high operating temperatures, 
around 200° C. Fuels must be free of carbon monoxide, 
as with the PEMFCs. With rated power over 50 kW, 
PAFC systems are used for stationary applications. 

Molten carbonate fuel cells (MCFCs) operate at 
temperatures around 650°C. The electrolyte is a mix¬ 
ture of molten carbonate salts, such as lithium and 
potassium carbonate. At the operating temperature, 
the carbonates melt and become conductive to carbon¬ 
ate ions. Due to the high temperature, high reaction 
rates can be achieved with low-cost catalysts such as 
nickel. Rated power for MCFC systems is typically 
greater than 1 MW. 

The last fuel cell system is the solid oxide fuel cell 
(SOFC). Typical operating temperature for the SOFC is 
between 600° C and 1,000°C. It has the same advantages 
of the MCFC (high reaction rates, inexpensive catalysts, 
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and natural gas can be used as a fuel). The electrolyte is 
in solid form, the most common kind being yttria 
(Y 2 0 3 ) stabilized with zirconia (Zr0 2 ), which is 
a ceramic and therefore makes the fabrication process 
complex and difficult. The power output can be greater 
than 200 kW, making the SOFC ideal for large station¬ 
ary applications, such as power stations. 

High-temperature fuel cells such as molten carbon¬ 
ate fuel cells (MCFC) or solid oxide fuel cells (SOFC) 
deliver off heat at high temperature level (T > 600° C). 
This can be used in high-temperature fuel cell cogene¬ 
ration for district heating (heat extraction), for absorp¬ 
tion refrigerators and heat pumps. As the energy 
demand in residential applications is complementary 
for heating and for cooling over the year, such a com¬ 
bination would be very useful. 

A battery or voltaic cell consists of one or more 
electrochemical galvanic cells which store and convert 
chemical energy into electric energy. Since the inven¬ 
tion of the first Voltaic pile in 1800, the battery has 
become a common power source for many household 
and industrial applications. The name “battery” was 
coined by Benjamin Franklin for an arrangement of 
multiple Leyden jars, an early type of capacitor. There 
are different types of batteries which comprise non- 
rechargeable batteries (primary batteries) in which 
chemical energy is stored internally, the discharging 
reaction, that takes place at the electrode/electrolyte 
interface, is irreversible, and rechargeable batteries or 
accumulators (secondary batteries) having a reversible 
discharging reaction [4]. 

In the nineteenth century, no difference was made 
between batteries and fuel cells. Today the definitions 
have been refined: batteries convert and store electrical 
energy and hence represent a thermodynamically 
closed system; energy storage and conversion occur in 
the same compartment. Fuel cells are open systems 
where the anode and cathode are just charge-transfer 
media and the active masses undergoing the redox 
reactions are delivered from outside the cell, either 
from the environment, for example, oxygen from air, 
or from a tank, for example, fuels such as hydrogen and 
hydrocarbons. Energy storage (in the tank) and energy 
conversion (in the fuel cell) are thus locally separated. 

Electrochemical capacitors (EC), also called 
supercapacitors, store energy using either ion adsorp¬ 
tion or fast surface redox reactions. In the first case, 


they are called electrochemical double layer capacitors 
(EDLC ), in the second case they are called pseudoca- 
pacitors. They can complement or replace batteries in 
electrical energy storage and harvesting applications, 
when high power delivery or uptake is needed. Several 
types of ECs can be distinguished, depending on the 
charge storage mechanism as well as the active materials 
used. EDLCs, the most common devices at present, use 
carbon-based active materials with high surface areas. 
Pseudocapacitors or redox supercapacitors use fast and 
reversible surface or near-surface reactions for charge 
storage. Transition metal oxides as well as electrically 
conducting polymers are examples of pseudocapacitive 
active materials. Electrochemical capacitors currently fill 
the gap between batteries and conventional solid state 
and electrolytic capacitors. They store 100-1,000 times 
more charge than the latter due to a much larger surface 
area (1,000-2,000 m 2 g -1 ) available for charge storage 
in EDLC. They have a lower energy density than batte¬ 
ries, and this limits the optimal discharge time to less 
than a minute, whereas many applications clearly need 
more [5] . Since the early days of EC development in the 
late 1950s, there has not been a good strategy for 
increasing the energy density; only incremental perfor¬ 
mance improvements were achieved from the 1960s to 
1990s. The increase in performance that has been dem¬ 
onstrated in the past few years is due to the discovery of 
new electrode materials and improved understanding 
of ion behavior in small pores, as well as the design of 
new hybrid systems combining faradic and capacitive 
electrodes [6]. 

To compare the power and energy capabilities 
of fuel cells, batteries, and supercapacitors, a rep¬ 
resentation known as the Ragone plot has been devel¬ 
oped. The terms specific energy [Wh/kg] and energy 
density [Wh/L] are used to compare the energy con¬ 
tents of a system, whereas the rate capability is 
expressed as specific power [W/kg] and power density 
[W/L]. A simplified Ragone plot (Fig. 4) discloses that 
fuel cells can be considered to be high-energy systems, 
whereas supercapacitors are considered to be high- 
power systems. Batteries have intermediate power 
and energy characteristics. There is some overlap in 
energy and power of supercapacitors or fuel cells with 
batteries. Batteries with thin-film electrodes exhibit 
power characteristics similar to those of super 
capacitors. 
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Figure 4 

Simplified Ragone plot of the energy storage domains for 
the various electrochemical energy conversion systems 
compared to an internal combustion engine and turbines 
and conventional capacitors (From [7]) 



Not all systems can be described using these defini¬ 
tions, since there are also combined systems such as 
metal-air batteries [8-10] which contain a battery elec¬ 
trode (metal anode) and a fuel cell electrode (air cath¬ 
ode), or redox flow batteries [11, 12] which are a form 
of rechargeable battery in which electrolyte containing 
one or more dissolved electroactive species flows 
through an electrochemical cell. Additional electrolyte 
is stored externally, generally in tanks, and is usually 
pumped through the cell of the reactor. The best known 
example is the chromium-iron battery [13]. The active 
masses are represented by Fe 3+ and Cr 2+ ions. In these 
redox storage devices, the mechanical aging of the 
accumulator electrodes is prevented. Flow batteries 
can be rapidly recharged by replacing the electrolyte 
while simultaneously recovering the spent material. 
Due to the overlapping properties of batteries and 
fuel cells, it is nowadays considered to find 
a definition that recombines the two systems again, to 
no longer differentiate between batteries and fuel cells 
like in former times. 

In order to bridge the gap between batteries and 
electrochemical capacitors, a capacitive or pseudoca- 
pacitive electrode can be combined with a battery elec¬ 
trode. Batteries exhibit usually high energy densities, 


while capacitors can deliver high power at the cost of 
low energy storage. Research efforts have been focused 
to increase the gravimetric energy of electrochemical 
capacitor electrodes by using materials with enhanced 
gravimetric capacitances (gravimetric charge storage 
per volt) [14]. The latter can be achieved by using 
carbon subnanometer pores for ion adsorption or by 
incorporating the pseudocapacitance of nanostruc- 
tured transition metal oxides. A recent report by S.W. 
Lee et al. [15] demonstrated that by using a novel class 
of electrodes for lithium storage high gravimetric 
energy (200 Wh kg electrode _1 ) was delivered at a high 
power of 100 kW kg e i ectrode _1 . Those electrodes were 
based on functionalized multiwalled carbon nanotubes 
(MWNTs) that included stable pseudocapacitive func¬ 
tional groups and were assembled using a layer-by- 
layer technique. Therefore, it was shown that it is pos¬ 
sible to benefit from both the capacitor and the battery 
properties. 

A critical parameter of fuel selection is its energy 
density. The latter should be as high as possible. Addi¬ 
tionally, the choice of fuel is also affected by the ease of 
transportation and storage which in general should be 
as easy and uncomplicated as possible. The choice of 
the fuels determines which problems, including envi¬ 
ronmental problems, will arise. 

Figure 5 shows the theoretical specific energies 
[(kWh)/kg] and energy densities [(kWh)/l] of various 
rechargeable battery systems in comparison to fuels, 
such as gasoline, natural gas, and hydrogen. The infe¬ 
riority of batteries is evident. These values are an indi¬ 
cation for the maximum energy content of certain 
systems. The practical values are significantly lower 
than the theoretical ones. A rechargeable battery has 
a practical energy content which is 25% of its theoret¬ 
ical value while a primary battery normally yields more 
than 50% of its theoretical value. The theoretical effi¬ 
ciencies of fuel cells for the conversion of fuels into 
electrical energy are higher than 70%. A number of 
factors influence the observed differences between the¬ 
oretical and practical energy storage capabilities. The 
systems contain inert parts such as conductive diluents, 
current collectors and containers, and internal resis¬ 
tances which results in internal losses and limited uti¬ 
lization of the active masses. Parts of the fuel can, for 
example, leave the fuel cell without having reacted or 
partial passivation of the electrodes can make parts of 
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Fuel Cell Comparison to Alternate Technologies. Figure 5 

Theoretical specific energies [k Wh/kg] and energy densities [k Wh/I] of various rechargeable battery systems compared to 
fuels, such as gasoline, natural gas, hydrogen, and alcohols 


the surface area inactive. For practical use, however, as 
batteries and fuel cells are not subject to the Carnot 
cycle limitations, they may operate with much higher 
efficiencies than combustion engines and related 
devices. The efficiencies of PEMFCs, batteries, and 
supercapacitors are summarized for comparison in 
Table 1. 

Electrochemical photovoltaics are a specific type of 
photovoltaic systems to convert solar energy, that is, 
solar radiation into electrical energy. Today, there is 
a variety of solar cells already on the market. Difference 
must be made between electrochemical photovoltaic 
cells and conventional silicon (Si)-based photovoltaic 
cells; in electrochemical photovoltaic cells two inter¬ 
faces exist at which charge transport switches from 
electronic to ionic and back. In photoelectrochemistry 
a photoactive semiconducting working electrode is 
used together with a metallic counter electrode. Both 
are immersed in an electrolyte containing a suitable 
redox couple. Upon radiation of the semiconductor- 
electrolyte interface with light having a higher energy 


Fuel Cell Comparison to Alternate Technologies. 
Table 1 Practical efficiencies of fuel cells, batteries, and 
supercapacitors 


System 

Process 

Efficiency 

PEMFCs 

Electrolysis- 
H 2 storage - 
FC operation 

40% 

Batteries 

Charging - 
Discharging 

80% 

Supercapacitors 

Charging - 
Discharging 

95% 


than the bandgap of the semiconductor, electron-hole 
pairs are formed and separated. Photogenerated 
majority carriers accumulate at the back side of the 
semiconductor, minority carriers travel to the semicon¬ 
ductor-electrolyte interface. The majority carriers are 
collected with a charge-collecting substrate and 
transported to the counter electrode where they 
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electrochemically react with the redox couple in the 
electrolyte. In the 1970s, wet-type electrochemical pho¬ 
tovoltaic cells were discovered [ 16-18] . Michael Gratzel 
demonstrated that dye molecules can be adsorbed 
onto nanocrystalline Ti0 2 working electrodes and 
thus extended the concept to dye-sensitized solar cells 
(DSSCs) [19-22]. 

It is noteworthy that electrochemical photovoltaic 
cells without dyes have working (semiconductor) and 
counter electrodes that are both immersed in the redox 
electrolyte. After the formation of electron/hole pairs, 
specific reactions occur at the semiconductor electrode 
and at the metal counter electrode. Charge balance is 
maintained via oxidation and reduction processes, 
and electrodes are often instable in aqueous media. 
Photoelectrochemical photovoltaic cells could only 
replace Si-based photovoltaics if a stable semiconduc¬ 
tor material with a band gap of ^1.4 eV is found 
[23-26]. In dye-sensitized solar cells, a dye-sensitized 
porous nanocrystalline Ti0 2 photoanode with a 1,000- 
fold surface area compared to flat electrodes has been 
used. Furthermore, the charge transport of the 
photogenerated electrons passing through all the par¬ 
ticles and grain boundaries is highly efficient [27]. The 
conversion efficiency of DSSCs has improved to 11.5% 
[28] since it was first reported with an efficiency of 
7.1% [26]. Their efficiencies are now comparable 
with amorphous silicon solar cells [29] but at a much 
lower cost. 

Electromobility: An Example for Combining Key 
Energy Technologies 

Concerns about global warming and the depletion of 
petroleum resources demand a shift in automotive trans¬ 
portation from petroleum-based internal combustion 
engines to electrically driven vehicles (EVs). In the con¬ 
cept of electromobility, all the above-mentioned technol¬ 
ogies should be united and combined in order to 
synergistically provide power to an EV. 

Hybrid electric vehicles are a first step, where an 
internal combustion engine is assisted by a battery- 
driven electric motor, enabling increased fuel efficien¬ 
cies when using high-power and low-energy (ca. 1 kWh) 
batteries. Further reduction or even elimination of 
petroleum-based fuels can be achieved by successively 
increasing the onboard stored electrical energy, thus 


enabling plug-in hybrid electric vehicles, short-range 
electric vehicles and, ultimately, long-range electric 
vehicles. Such a development would be the perfect 
solution to realize electric traction in general. 

Together with batteries and supercapacitors, fuel 
cells are a key component that must be investigated in 
order to build electric vehicles suitable for continuous 
use, especially in public transport. Fuel cells have the 
potential to offer clean power generation, are quiet in 
operation, and can be located close to the application. 
They produce much less greenhouse emissions and can 
be more efficient in conversion of the energy in a fuel 
into power than gasoline engines or utility thermal 
power plants. Fuel cells are best utilized as a steady 
energy source and not as a power source to supply 
dynamic demands. For applications that require vary¬ 
ing power demands, such as automotive propulsion, 
the use of the fuel cell in a hybrid configuration with 
a battery or an electrochemical capacitor is required. 
The fuel cell provides steady power demand while the 
battery or the electrochemical capacitor handles the 
surge for regenerative breaking and acceleration as 
well as initial start-up. This makes fuel cells an essential 
component for hybrid vehicles with adequate ranges as 
they are required for public transport and goods traffic. 

Fuel cells, batteries, and supercapacitors have 
a similar configuration, that is, two electrodes sepa¬ 
rated by an electrolyte. The latter are designed for 
high power and long life service. Currently nonetheless, 
the status of batteries, supercaps, and fuel cells does not 
allow for a satisfactory use in electric vehicles due to 
tremendous weight and safety issues, too high costs, 
and too short ranges. Thus, fundamental research is 
necessary to develop novel battery, supercapacitors, 
and fuel cell concepts for a new generation of electric 
vehicles. 

Nowadays, also concepts are developed in which 
photovoltaic cells integrated on the roof or in the 
windows of EVs can be used to recharge the batteries. 
It is known from literature that EVs suffer from too 
short ranges and too high weight [7, 30]. Figure 6 
shows typical driving ranges of battery-powered cars 
compared to ranges of cars powered by modern com¬ 
bustion engines. As it can be seen, for EVs powered 
with any secondary battery system (i.e., lead acid, 
nickel metal hydride, sodium sulfur, and lithium ion), 
the driving ranges that they possess are inferior 
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Fuel Cell Comparison to Alternate Technologies. 

Figure 6 

Comparison of the driving ranges for a vehicle powered by 
various battery systems or a gasoline-powered 
combustion engine (From [7]) 


compared to those of conventional combustion 
engines. Therefore, it is illustrated clearly why rather 
fuel cells and not batteries should be considered for 
replacement of combustion engines. 

Fuel cells and batteries can be used to build full 
electric vehicles and thus provide pure electrification 
solutions for different mass and usage segments of 
automotive application. Battery powered EVs based 
on lithium ion technology will be limited to small- 
vehicle low-mileage-per-day applications which is due 
to the low specific energy and long charging times. The 
required 300 mile range has been reached with hydro¬ 
gen fuel cell vehicles and the latter have also been 
proven to be able to operate in all climates. Neverthe¬ 
less, the costs of the Pt-based catalysts needed in a fuel 
cell are too high, the hydrogen infrastructure is not 
present and cost-effective renewable sources of primary 
energy are not yet efficiently used. Drastic decreases in 
the amount of Pt used are required which is put into 
practice via a number of strategies currently under 
development [30]. 

A very promising approach to energy conversion 
in electric vehicles is the use of metal-air batteries, 
for example, Li-air systems. As was discussed in the 
earlier section, the metal-air battery combines fuel cells 
with batteries. In the latter example, substitution of the 
conventional lithium intercalation cathode with an air 
cathode is projected to increase the cathode specific 
capacity by a factor of ~3.5. Additionally, the concept 
of Li-air systems has another major advantage; it can 


drastically decrease the weight of the battery pack. 
Specifically, for the case of short-range EVs (~25 kWh), 
the total battery would weigh only ~40 kg. For the case 
of a full-range EV, the projected Li-air battery-pack 
weight is only 120 kg. This is significantly smaller com¬ 
pared to the minimum weight of 330 kg for a best case 
Li-ion battery pack. Therefore, the Li-air battery tech¬ 
nology would be a very promising and enabling technol¬ 
ogy for full-range EVs. Li-air batteries give promise of 
extending the range, but scientists and engineers will 
have to face many challenges in order to prove the 
research in this area effective. 

The electrical energy required on board to travel 
a certain range before refueling is provided by the 
energy storage/generation system and is determined 
by the total gross energy consumed minus the energy 
recovered from regenerative braking (Wh net /mile). All 
loads on board can be major drains of energy since they 
are powered by the electrical storage/generation system 
which affects the driving range. Loads that severely 
affect the driving ranges are heating and cooling devices 
in the car. The vehicle mass (m) including the mass of 
the powertrain system and of the passengers, the frontal 
area (A), the drag coefficient (C d ), and the drive cycle 
are elements that influence the Wh net /mile requirement 
to the electric motor the most. Furthermore, the rolling 
resistance and the ancillary loads mentioned above 
have strong influence. 

An equation has been developed to correlate the 
different parameters [30]: 

Energy required = aj x m + a 2 x Q x A + a 3 

a 1? a 2 , and a 3 are obtained from modeling results. The 
model is primarily based on the greenhouse gases, 
regulated emissions, and energy use in transportation 
(GREET) software developed by the Argonne National 
Lab [31]. In Table 2 [30], estimates are listed that start 
from a small subcompact car (C d xA=0.6 m 2 ) with 
a total vehicle mass of 1,200 kg for which the energy 
required to drive the Environmental Protection Agency 
(EPA) city or highway cycle is 138 or 156 Wh net /mile, 
respectively. The energy required for a van (CdxA= 
1.56 m 2 ) weighing 2,500 kg is 283 or 361 Wh/mile 
for city or highway cycle, respectively. The energy 
requirement for driving in the city is lower because 
the vehicle has less resistance to overcome at lower 
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Fuel Cell Comparison to Alternate Technologies. Table 2 Electrical energy required as a function of the vehicle type and 
mass (analysis assumes 70% recovery of braking energy through regenerative braking) 


Vehicle type 

Mass (kg) a 

C d xA (m 2 ) 

City (Wh net /mile) 

Highway (Wh net /mile) 

Sub-compact 

1,200 (2) 

0.60 

138 

156 

Compact car 

1,400 (4) 

0.64 

154 

171 

Midsize car 

1,550 (4) 

0.67 

166 

183 

Full-size car 

1,700 (5) 

0.71 

178 

194 

Minivan 

2,200 (7) 

0.93 

224 

250 

Van 

2,500 (8) 

1.56 

283 

361 


Taken from Ref. [30] 

a The mass includes 65 kg passengers weight shown in parentheses 



speed, electricity generation is more efficient at lower 
loads, and the use of regenerative braking preserves 
these advantages for the city cycle [30]. 

The above given examples show that the range 
problem is the most severe problem of batteries used 
to power electric vehicles. Fuel cells have the potential 
to overcome this issue. On one hand, rather cheap 
materials are used in Li ion batteries, while on the 
other hand they require a very large surface area of 
very finely controlled thin layers, interfaces, and sepa¬ 
rators. Li ion batteries are already mass produced for 
use in portable electronic devices and thus, many of the 
opportunities for cost reduction through scale have 
already been taken. Nowadays, the development and 
implementation of improved materials must be the way 
to further reduce costs. 

In fuel cells, however, more expensive materials 
are employed even though their amount can be tre¬ 
mendously reduced. The total geometric surface area of 
the cells shows a 30-fold decrease due to higher current 
densities obtained with more conductive fuel cell 
electrolytes. Fuel cells can be built in a bipolar 
construction with cells stacked in series with the nega¬ 
tive current collector of one cell serving as the positive 
current collector of the adjacent cell. Fuel cells require 
a hydrogen tank and an air compressor which makes 
their balance of plant more complex. 

Strong investment is necessary to develop EVs 
based on the concept of electromobility. The advan¬ 
tages of all available energy conversion and storage 
technologies need to be utilized in order to achieve 


efficient full vehicle electrification. It is very likely that 
both battery and fuel cell technologies will be devel¬ 
oped for automotive application, since batteries are 
well suited for the use in small short-range vehicles 
and fuel cells are able to deliver electric power for 
long-range vehicles. Super capacitors can aid with 
their high power density output whenever it is needed. 
Photovoltaics integrated in the EV can convert the solar 
energy and charge the batteries. The main point is that 
until now no single technology shows the potential 
to monopolize the implementation into an electrical 
vehicle. For fuel cells specifically, research needs to 
focus into identifying novel catalyst materials that can 
reduce the amount of expensive catalysts currently used 
(e.g., Pt). Additionally, fuel cell technologies based on 
alternative fuels other than hydrogen need to be devel¬ 
oped. For all suited energy technologies applicable in 
EVs, the key challenge is their smart combination in 
order to build long-range vehicles which produce less 
greenhouse gas emissions than cars powered by com¬ 
bustion engines. 

Future Directions 

In summary, even though the world is becoming aware 
of an upcoming energy crisis, according to all statistical 
data currently available, most of the worldwide energy 
demand is still delivered via fossil fuels, while renew¬ 
ables unfortunately still account only for a small part. It 
is a fact that use of fossil fuels results in greenhouse gas 
emissions to the atmosphere. For this reason, the 
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industrialized states, under the Kyoto protocol, have 
committed themselves to reduce their greenhouse gas 
emissions by a total of 5% in the period of 2008-2012 
compared to 1990. 

Most of the C0 2 emitted is a result of the inefficient 
conversion and distribution systems that we use. 
The current system efficiencies for energy conversion 
and distribution allow approximately only 30% of 
the primary energy to be delivered as usable energy. 
If we manage to increase the system efficiency, 
C0 2 emissions will be reduced drastically. Therefore, 
there is the urgent need to develop efficient energy 
conversion and distribution systems that emit either 
no or very little C0 2 . A technology that has the poten¬ 
tial to help in the reduction of C0 2 emissions is 
fuel cells. 

Fuel cells have the potential to efficiently convert 
chemical energy to electricity. Even though as discussed 
the concept of fuel cells is not new and research efforts 
worldwide have been devoted to advance the technol¬ 
ogy, there are still many ways that need to be covered 
before bringing them into the market. Some of the key 
issues related to fuel cells that need to be tackled before 
even discussing about a broad commercialization of the 
technology are: 

(a) Low efficiency at high power densities 

(b) Use of hydrogen which is only a secondary energy 

carrier 

(c) Large amount of noble metal required for wide¬ 
spread applications 

(d) Cost and availability of Pt which is commonly used 

as catalyst 

(e) Durability and lifetime of the system 

The first issue is the low efficiencies at high 
power densities of current fuel cell systems. For exam¬ 
ple in PEMFCs, typical power densities delivered 
are ~1 Wcm“ 2 at 0.7 V operating voltage. That 
accounts for a voltage efficiency of ^56% while the 
overall system efficiency is even lower. The power out¬ 
put at the high voltage regime is controlled by 
electrocatalytic properties and suffers from losses 
attributed mainly to the slow kinetics of the oxygen 
reduction reaction. In order to achieve a high-power, 
high-efficiency system, it is necessary to identify and 
develop highly active catalyst materials that could push 
the power output to ^1 Won -2 at 0.85-0.9 V. 


The second issue relates to hydrogen which is the 
employed fuel for low-temperature fuel cells. The 
advantage of using hydrogen is that water is the only 
product. Nonetheless, we need to take into consider¬ 
ation that hydrogen is only a secondary energy carrier 
which is not freely available and is currently produced 
by “non-clean” methods, for example, reforming of 
natural gas. Hydrogen can be produced by regenerative 
sources but those suffer from low efficiencies. It would 
be beneficial to fabricate systems that can directly con¬ 
vert alternative fuels such as bio-derived alcohols which 
have higher energy densities and can be produced effi¬ 
ciently. Direct methanol fuel cells are a starting point 
but at their current status not a satisfying solution. Bio¬ 
ethanol is a very attractive candidate. It can be pro¬ 
duced by cellulosic biomass, has high energy density, 
is nontoxic, and its liquid form allows using the current 
fuel infrastructure. Research should also aim at identi¬ 
fying highly active catalysts toward ethanol oxidation 
and electrolytes with low ethanol permeability that will 
result in designing an efficient system that will directly 
convert ethanol to electricity. 

The last three issues refer to the current material 
systems used in fuel cells. Pt supported on carbon 
(Pt/C) is commonly used in PEMFCs as catalyst and 
in order to achieve desirable power densities high 
amounts of noble metal are required. Even though Pt 
possesses high catalytic activity, it is also associated 
with limited resources, and high price. Efforts to reduce 
the metal loading and alloy Pt with other inexpensive 
and more abundant elements, that is, metals, rare 
earths have shown improvements, however, there is 
no system found that would lead PEMFCs to 
a broader commercialization. Carbon is commonly 
used as support material in several fuel cell systems 
since it is inexpensive with high electronic conductivity 
and high availability. Carbon though is the main reason 
of the low lifetime since it undergoes corrosion. 
Efforts to nanostructure carbon, that is, nanotubes, 
nanofibers, have shown only unsubstantial improve¬ 
ments. Pt and C are materials that have been exten¬ 
sively investigated for many years but have not brought 
fuel cells to the market. Therefore research should focus 
to identify novel catalyst/support systems that would 
replace Pt and C. 

Two possible future scenarios for stationary elec¬ 
tricity production and road traffic can be envisioned. 
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For the first scenario small units (<1 MW) based 
on SOFC with natural gas, biogas, or ethanol could be 
implemented. In niche markets, electricity storage 
could be achieved with electrolysis to produce hydro¬ 
gen that could subsequently be stored and then used 
in a fuel cell system. For larger units (> 10 MW), SOFCs 
could be used with natural gas, biogas, carbon from 
biomass. For the latter, a negative C0 2 effect is possible 
(electricity will decrease C0 2 concentration in the 
atmosphere). For the second scenario of road traffic, 
two options are possible. City travel could be handled 
with battery-operated motor bikes and small cars, pos¬ 
sibly in combination with a small (1-5 kW) fuel cell. 
Long-range travel could be achieved with larger vehi¬ 
cles using mid-temperature (200-300°C) bioethanol 
fuel cells hybridized with batteries. In both systems, 
supercapacitors could be employed to handle the high 
power needs. For both scenarios, all other energy 
conversion technologies (e.g., solar, wind) can be used 
in order to convert and subsequently store energy 
(i.e., battery recharging). 

In conclusion, higher targets need to be set for the 
future and achieved before fuel cells can be a com¬ 
mercially viable technology. There have been numerous 
efforts to improve fuel cells but all solutions proposed 
so far are only small steps that address partially 
only one of the above-mentioned issues. High-risk, 
high-profit solutions are required that will drastically 
improve the current fuel cell status and will lead them 
to their broad commercialization. In fuel cell develop¬ 
ment, we need to match the available energy source 
with the application. Hydrogen does not seem to be an 
option. A direct conversion of an available fuel in a fuel 
cell would be more desirable. For example, alcohol 
oxidation at all temperatures and (hydro-) carbons 
could be used at high temperatures. In all situations, 
fuel cell research has to acknowledge the fuel problem. 
Therefore, if one believes that we are close to the end of 
research for fuel cells and other energy technologies 
then the reality is different ... we are only at the 
beginning. 
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Glossary 

Balance of plant (BOP) Auxiliary components that 
complement a power unit as a functional or oper¬ 
ational unit. 

Electric vehicle (EV) A vehicle that uses battery or any 
other onboard electrical power source to operate 
the vehicle. 

Fuel cell A device that uses fuel and oxidant in 
an electrochemical reaction to generate direct 
electricity. 

High-temperature proton exchange membrane fuel 
cell (HT-PEMFC) A PEM fuel cell that operates 
between 120°C and 200°C. 

Hybrid electric vehicle (HEV) A vehicle that uses an 
auxiliary electrical power source, such as a fuel cell 
or a battery, along with an internal combustion 
engine (ICE) for propulsion. 

Internal combustion engine (ICE) A device that con¬ 
verts the chemical energy of fuel in a cylinder to 
mechanical work through the piston, connecting 
rod and crank shaft mechanisms. 

Low-temperature PEM fuel cell (LT-PEMFC) A PEM 
fuel cell that operates between 50°C and 80°C. 
Proton exchange membrane (PEM) A fuel cell that 
uses either a sulphonated (low temperature) mem¬ 
brane or polybenzimidazole (high temperature) 
membrane to generate direct electricity. 

Reformate A fuel derived from fossil fuel (natural gas) 
after reformation that is a mixture of hydrogen, 
carbon monoxide, and carbon monoxide. 

Solid polymer fuel cell (SPFC) Same as PEM above. 

Definition of the Subject 

A hybrid vehicle implies a vehicle that is powered by 
more than one source of power. It can be an internal 
combustion engine and a battery, an ICE and fuel cell, 
and ICE and solar cells or fuel cell and batteries. The 
current trend of hybridization of internal combustion 
engines (ICE) is a result of the inefficiency of ICE. 
Spark ignition internal combustion engine, SI-ICE, 
is about 25-27% efficient, while compression ignition 
internal combustion engine, CI-ICE, (diesel) is 
34-38% efficient [1, 2]. The remaining percentage 
of the input energy into ICE is expended in the form 
of heat loss to the coolant and exhaust, by convec¬ 
tion, conduction, and radiation, and by combustion 
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inefficiency. The electric drive has a higher efficiency 
around 90%, as such, it is justifiable to incorporate 
electric drive into the hybridized powertrain. With the 
increased use of electronics in cars for various applica¬ 
tions, such as visual display of GPS, fuel economy, 
maintenance schedule and diagnostics, voice activation 
of devices and DVD players, etc., there is a need to 
use additional power sources to maintain this level 
of power demand without compromising the power 
required for ignition, and operate the electronic control 
unit (ECU), anti-brake systems (ABS), and the fuel 
injection units. 

In the hybridized ICE-powered vehicle, in the 
series mode of connection, the mechanical drive 
shaft that transmits power from the crankshaft 
through the gearbox to the wheels is replaced with 
a generator and electric wires to the electric motors 
spinning the wheels. The design configuration of the 
ICE-battery hybrid presents the base design of other 
complex hybrid vehicles architecture. The design 
structure of the fuel cell-powered hybrid electric vehi¬ 
cle includes fuel cell as the primary power source and 
a battery as a secondary power source for start-up, 
lighting, and power electronics support. The power 
transmission shaft reminiscent of the internal com¬ 
bustion engines (ICE)-powered vehicle is absent, 
rather electric motors spin the four wheels. Fuel cells 
generate direct electricity that is used to drive the 
electric motors in hybrid vehicles. As such, reliability 
is enhanced as there are fewer rotating components. 
The complexity and maintenance are reduced in elec¬ 
tric machinery compared with mechanically driven 
machinery. The following report will present different 
types of fuel cells and their electrolytes, sources 
of fuels for fuel cells, those applicable to the transpor¬ 
tation sector and the hybrid vehicle architecture. 
The ICE hybrid will also be presented as a base 
model, since its operation is well understood by the 
general public. 

Introduction of Fuel Cells 

Fuel cells are electrochemical electricity-generating 
devices that oxidize the fuel at the anode to electrons 
and protons and reduce oxygen at the cathode to produce 
heat and water. The splitting of hydrogen into protons 
and electrons is done at the catalyst layer (anode). 


The electrons move across the external circuit generat¬ 
ing electricity, while the protons permeate through the 
PEM to the cathode. The proton exchange membrane 
fuel cells are of interest in the mobility sector as it is 
modular (scalable) and, as such, can be built to desirable 
sizes without compromising the efficiency or perfor¬ 
mance. Various petroleum and nonpetroleum fuels, 
chemical compounds, and hydrogen carriers can be 
used as fuels in fuel cells after preprocessing. However, 
hydrogen is the most preferred fuel with the end prod¬ 
ucts, in case of the proton exchange membrane (PEM) 
membrane being water, with the release of heat. Hydro¬ 
gen is widely used in most fuel cells ranging from the 
low to the high-temperature PEM to solid oxide fuel 
cells (SOFCs). 

Commonly Used Fuels in Fuel Cells 

Fuels commonly used in fuel cells are: hydrogen, 
methanol, potassium hydroxide; phosphoric acid; 
methane (natural gas); carbon monoxide; carbonate 
salts of sodium, lithium, and potassium; and yttrium- 
stabilized zinc oxide as the electrolyte. Table 1 sum¬ 
marizes various fuels and applicable fuel cells that 
utilize them. 

Low-Temperature and High-Temperature PEMs 

The low temperature PEM fuel cell shown in Fig. 2a 
relies on membrane humidification for ion transport 
from the anode to the cathode through Grotthuss 
mechanism or ion-hopping mechanism. If the mem¬ 
brane is not well humidified or is dry, vehicular 
mechanism is involved in the ion transport. For the 
high-temperature PEM fuel cell based on polybenzi¬ 
midazole (PBI) membrane, which is presented in 
Fig. 2b, the membrane is treated in sulfuric or phos¬ 
phoric acid. Protons are transported in an acidic 
medium when the temperature is raised up to 130°C 
and above. The membrane performs much better at an 
elevated temperature of 170-200°C. Precautionary 
measures require that the membrane be raised to 
above 130°C before the load is applied. Better results 
are obtained if the cell is allowed to run and equilibrate 
at a reduced load for about 100 h before readings are 
taken. This allows the membrane to equilibrate. During 
the shutdown period, the membrane should be purged 




3862 


F 


Fuel Cell Powered HEV Design and Control 


Fuel Cell Powered HEV Design and Control. Table 1 Different types of fuel cells 


Parameters 

PEM 

LT-PEM 

HT-PEM 

DMFC 

AFC 

PAFC 

MCFC 

SOFC 

Fuels 

H 2 

H 2 , Reformates 

Methanol 

KOH (H 2 ) 

P0 4 (H 2 ) 

h 2 , CO, ch 4 

h 2 , CO, ch 4 

Charge carrier 

H + 

H + 

H + 

OH 

H + 

co 3 2 - 

O” 

Electrolyte 

PEM 

PBI 

PEM 

30-50% 

cone. KOH in water 

100% cone, 
in P0 4 

a 

b 

Temperature 

range 

50-80°C 

130-200°C 

50-80°C 

200°C 

150-220°C 

650°C 

650-1,000°C 


LT-PEM low-temperature PEM that are made of poly-sulphonated membranes, such as NAFION marketed by Du Pont, GORE membrane, 
ASAHI glass, 3M, etc. 

FIT-PEM High temperature membrane, such as polybenzimidazole (PBI) membranes made by BASF (USA), etc. 
a Liquid molten carbonate salts, such as Na 2 C0 3 and Li 2 C0 3 , in a ceramic matrix of LiAI02 
b Solid oxide ceramic (Yttrium-stabilized zirconium (Zr0 2 /Y 2 0 3 ) 

with nitrogen gas till it cools to ambient conditions to 
avoid leaching of the acid and degrading of the mem¬ 
brane performance [3]. 

Source of Fuels for Fuel Cell 

The current earthquake in Japan and Virginia, USA, is 
a reminder that the nuclear power-generating plants 
are very vulnerable to natural disasters. Nuclear plant 
(prior to the earthquake in Japan) has been projected 
to be the main source of hydrogen production, as it 
is cleaner and more efficient. The Chernobyl 
(1986) nuclear disaster and the Japanese (2011) disaster 
triggered by an earthquake of 9.0 magnitude on the 
Richter Scale, clearly drew a line between science, engi¬ 
neering, and nature. Despite the precautions and tech¬ 
nological advancement of Japan, it could not contain 
such magnitude of earthquake. Couple of months after 
the incident, the levels of radiation at the Japanese 
nuclear meltdown have been upgraded to level 7, 
same as the Chernobyl radiation after the disaster. 

The Virginia quake of summer 2011 rattled Washing¬ 
ton D.C., metropolis with a magnitude of 5.8 on the 
Richter Scale. This spontaneously shutdown the 
nuclear reactor in Virginia, raising serious concerns of 
possible nuclear radiation to the atmosphere. These 
recent incidents pre-suggest that another sustainable 
source of hydrogen production (and not nuclear 
power plant) is needed for the hydrogen economy. 

Following the above Japanese event, Japan, Germany, 


and Switzerland have frozen proliferation of building 
further new nuclear reactor plants. 

The question now is which is the most environmen¬ 
tally and sustainable way of producing hydrogen for the 
hydrogen economy? Current pathways of hydrogen pro¬ 
duction include: steam methane reformation (SMR), 
partial oxidation (POX) process, and autothermal refor¬ 
mation (ATR) process of fossil fuels. However, the pro¬ 
duction and availability of fossil fuels are projected to dip 
after 2020, and as such, a renewable source of hydrogen is 
required. Electrolysis of water is a viable source; however, 
energy is required to electrolyze water and compress it to 
350 bar (5,000 psi) or 700 bar (10, 000 psi) in order to 
store enough hydrogen to meet the same driving dis¬ 
tance as its gasoline equivalent and package the fuel tank 
into the available fuel storage space. First, an analysis of 
the five types of fuel cells will be presented highlighting 
their potentials for hybrid vehicle propulsion. 

In the final analysis, water electrolysis method to 
obtain hydrogen is a clean technology, but requires 
energy for the process and storage of the gas under 
pressure or possibly liquefaction of the fuel. The cur¬ 
rent technology of steam reformation relies on fossil 
fuels as feedstocks. Future energy carrier should be 
sustainable, clean, and renewable. Fuel cells for trans¬ 
portation in hybrid vehicles are feasible with the scal¬ 
able fuel cell categories, such as PEM, direct methanol 
fuel cell (DMFC), alkaline fuel cell (AFC), and to 
a limited extent, with the solid oxide fuel cell (SOFC) 
as an auxiliary power unit (APU). 
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Fuel Cell as a Power System 

Generally, fuel cells have the following advantages over 

other power-producing devices: 

1. Low to zero emissions, if a reformate or hydrogen is 
used as fuel, which makes it environmentally very 
friendly. 

2. Performance and efficiency are very high as there 
are no moving parts (excluding the system’s balance 
of plant, BOP). 

3. It is a bridge to hydrogen economy. The bridge may 
be in the form of electric vehicles (EV) or hybrid 
electric vehicles (HEV) - in which both fuel cell and 
a smaller internal combustion engine (ICE) are 
used for propulsion in the medium and low- 
power applications. 

4. Absence of transmission lines which decreases the 
complexity of the BOP. 

5. Flexible, scalable, and adaptable where remote 
power systems are required, for example, remote 
stations, booster stations, traffic and bacon lights, 
road signs, street lights, etc. 


6. For space missions. Fuel cells have long been used 
for manned space missions, for example, the Apollo 
Mission of the 1960s and the Shuttle Orbiter. 

In the following analysis, much attention is given to 
PEM because of its potential as the viable candidate 
fuel cell for use in transportation. It is noteworthy that 
not all types of fuel cells are suitable for use in the 
transportation industry because of the power density, 
type of electrolyte, size, and thermal cycling. 

Analysis of Various Fuel Cells 

The Proton Exchange Membrane (PEM) Fuel Cell The 
PEM fuel cell derives its name from the electrolyte 
used, which is a conducting membrane sandwiched 
between two catalyst layers, gas diffusion layers and 
plates. The schematic is shown in Fig. 1. A PEM fuel 
operates by inducting hydrogen into the anode, where 
it is oxidized into electrons and protons as per the 
equation: 

H 2 —> 2H + + 2e“ (1) 




Plate 

Gas Diffusion - 

Layer (GDL) 


Catalyst Layer 


Gas Diffusion 
Layer (GDL) 


Fuel Cell Powered HEV Design and Control. Figure 1 

PEM fuel cell components 
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The two electrons are not membrane permeable; hence, 
they travel through the anode electrical interconnect 
to the load, generating electricity. From there, the 
electrons travel to the cathode catalyst layer, where 
they react with the supplied oxygen and the protons 
that permeate through the membrane to the cathode 
catalyst layer to form water in a reduction reaction. The 
process is exothermic. 

At the cathode, the reaction is as follows: 

y 2 0 2 + 2e“ + 2H + -► H 2 0 + Heat (2) 

Overall reaction: 

H 2 + i/ 2 0 2 H 2 0 + heat (3) 



The schematics of a PEM fuel cell, a 50 cm 2 LT-PEM 
and HT-PEM fuel cell hardware are shown in Figs. 1, 2a 
and 2b respectively, while and multi-test station is 
shown in Fig. 3. 

Figure 3 features a SERC (Shartz Energy Research 
Center) four test station (TS) platform for testing and 
validation of a single cell to 1 kW stack. The two TS to 
the left of the computer monitor operate in a dead-end 
mode and are capable of testing single cells and low- 
end stacks rated up to 250 W. While TS 3 and 4 are 
rated up to 1 kWe and operate in a flow-through mode. 
Low-temperature PEM relies on humidification of the 
reactants and employs various types of humidifiers. For 
example, the 4-cell PEM stack on TS 2 has an-inbuilt 
membrane humidifier while TS 4 has an externally 
mounted humidifier shown in Fig. 4. That aside, PEM 
fuel cell stacks require a cooling system which also 
serves as the medium for maintaining the cell/stack 
temperature at a constant temperature. The test equip¬ 
ments are designed and sized for testing various output 
power ranges of cells/stacks. As the power output of the 
stack increases, the measuring devices ranges and pipe 
sizes are progressively increased to deliver the desired 
amount of reactants. Figures 5 and 6 show test equip¬ 
ment for low and high-temperature PEM fuel cell test 
equipments capable of testing and validating a 5 kW 
PEM stack. 

For the mobility sector, low-temperature PEM 
requires a complex balance of plant for its smooth 
operation. That aside, water from the exhaust is an 
issue of consideration during city driving in winter. 
It can contribute to slipperiness of the road and can 



Fuel Cell Powered HEV Design and Control. Figure 2 

(a) LT-PEM single cell of area 50 cm 2 assembled at the 
Center for Fuel Cell Research, Kettering University, 
Michigan (Courtesy Kettering PEM Fuel Cell Lab), (b) HT-PBI - 
PEM Single cell of area (Courtesy: Kettering PEM fuel cell 
research and powertrain integration, laboratory) 


be a hazard to other road users. On the other hand, the 
high-temperature fuel cells offer an advantage in that 
vapor exits from the exhaust and can be used for 
heating the vehicle in winter. However, both low- and 
high-temperature PEM require heated pads to keep the 
stacks above 10°C during winter season. Freezing of 
liquid within the stack’s flow field, in the gas diffusion 
layer and condensates in the exit flow manifold are all 
unacceptable. 

PEM membranes are categorized as low-temperature 
and high-temperature membranes. A review of both 
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Fuel Cell Powered HEV Design and Control. Figure 3 

Multi-test station (TS) for LT- and HT-PEM research 



Fuel Cell Powered HEV Design and Control. Figure 4 

Humidifier for a LT-PEM fuel cell at Kettering University Center for Fuel Cell Research 


types highlights the following advantages and disadvan¬ 
tages: Both are credited with good startability - they start 
to generate electricity shortly after start-up (LT-PEM 
from 60°C and HT-PEM at 130°C). Performances of 


PEM membranes are generally degraded by the presence 
of carbon monoxide. For LT-PEM, CO concentration 
above 10 ppm is not recommended, while HT-PEM 
based on a PBI can tolerate up to 3% CO in the fuel. 
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Fuel Cell Powered HEV Design and Control. Figure 5 

A 5 kW Greenlight low-temperature PEM test equipment at Kettering University Center for Fuel Cell Research 


Higher concentrations have been tested up to 8% at 
a much reduced voltage [3-6]. Operation around 
190°C favors higher CO concentration and the loss in 
performance is reversed if the cell is operated later with 
pure hydrogen [3]. 

Both LT and HT-PEM require thermal management 
systems - LT-PEM uses water as the coolant while 
HT-PEM uses mineral oil. However, LT-PEM in addi¬ 
tion relies on humidification of the membrane for 
proton transport. The functional humidification 
range is between 80% and 100% relative humidity 
(RH). Above 100% RH, the cell floods and impairs 
the flow of the reactants to the catalyst sites. As 
a consequence, the cell reaches its mass concentration 
limit, a point where the flow of reactants to the catalyst 
sites diminishes and the fuel cell voltage falls off. At 
a lower RH, the membrane dries up and impairs proton 
transfer from the anode to the cathode. This inade¬ 
quacy of the LT-PEM led to the development of a more 
tolerant membrane, capable of operating without 
humidification and its complexities. Its electro¬ 
chemical kinetics are enhanced because of its high- 
temperature operation; also, the activation overvoltage 
is reduced. It is more tolerant to a higher CO in the fuel 
composition. 


Future fuels require flexibilities to fuel contami¬ 
nants. Experience has shown that conventional fuels’ 
dispensing, storage, and transportation systems 
(trucks, pipelines, etc.) harbor huge deposits of foreign 
materials that degrade the quality of fuel with time. 
High-temperature membranes are more resilient and 
less sensitive to increased CO constituents in the 
fuel than sulphonated membranes and can tolerate 
reformates. This enables reformers to be coupled 
directly to the fuel cell which eliminates the complexity 
of post-reformation processing of the fuel to obtain 
hydrogen of 99.997% purity. 

Direct Methanol Fuel Cell (DMFC) The direct 
methanol fuel cell has the same architecture as 
hydrogen-air PEM fuel cell in that it uses sulphonated 
membranes. It is a low-temperature fuel cell that 
operates between 50°C and 80° C. Methanol is fed to 
the anode while air is fed to the cathode. The anode 
reaction proceeds as follows: 

CH 3 OH + H 2 0 -► +C0 2 + 6H + + 6e“ (4) 

At the cathode, the reactions proceed as: 

IV 2 O 2 + 6H+ + 6e“ -► 3H 2 0 (5) 
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Fuel Cell Powered HEV Design and Control. Figure 6 

A 5 kW high-temperature PEM fuel cell test stand at 
Kettering University Center for Fuel Cell Research 

The overall reaction is: 

2CH 3 OH + 30 2 -► 4H 2 0 + 2C0 2 (6) 

The reaction in DMFC is very slow in both electrodes 
despite its apparent advantages of releasing six 
electrons. To remedy this, a bi-metal catalyst Pt/Ru is 
used at the anode with a tenfold increase in catalyst 
loading compared with hydrogen-air PEM fuel cell 
[7-10]. The activation overvoltages in both electrodes 
of DMFC is considered, unlike hydrogen-air PEM 
where the anode activation overvoltage is ignored 
because it is by many orders of magnitude smaller 
than the cathodic activation overvoltage. DMFC is 
a candidate for portable devices such as cell phones, 
PDAs, soldier power, laptops, DVD players, iphones, 
ipads, ipods, kindles, and other portable devices. It 
works well in remote areas where power is required 
for a prolonged period such as remote sensing 
stations, railway crossings, traffic lights, bacon lights, 
professional cameras, etc. It is one candidate for light 


HEV application but the technology has not been fully 
developed for such applications yet. DMFC has a low 
power density (kW/L); hence, its role as a power source 
for transportation is diminished. 

Alkaline Fuel Cell The AFC uses liquid KOH 
electrolyte for its operation and it is popular for its 
past space missions in Apollo Spacecraft in the 1960s 
and later in the Space Shuttle Orbiter. Due to the 
renewed interest in the development of solid polymer 
membranes in the early 1990s, less attention has been 
given to AFC. The advantage of AFC is overwhelming. 
The electrolyte is cheap and has a proven technology 
for the production of KOH. However, the electrolyte is 
sensitive to the presence of atmospheric C0 2 in the air 
which combines with the potassium to form an 
insoluble K 2 C0 3 that renders the electrolyte insoluble, 
hence degrading its performance. AFCs are installed in 
utility vehicles which operate within a restricted area 
where pollution by automobiles is restricted. It can be 
operated as a mobile and stationary power source. 


The basic anode reaction 


2H 2 + 40H“ -► 4H 2 0 + 4e" 

(7) 

At the cathode 


2H 2 0 + 0 2 + 4e _ 40H“ 

(8) 


Phosphoric Acid Fuel Cell (PAFC) Phosphoric acid 
fuel cell PAFC derives its name from the type of 
electrolyte used. It is a medium temperature fuel cell 
used for stationary power generation. The operational 
temperature is in the 200-220° C range. It is one of the 
oldest and matured fuel cells with vast production in 
the ranges of 200 kW. However, more powerful units 
ranging in MW have been introduced into service. It 
finds wide applications as back-up power units in 
credit card companies’ headquarters, police precincts, 
hospitals, banks, airports, etc. The USA and Japan are 
the two leading countries in the production and 
application of PAFCs. Phosphoric acid fuel cell is 
equally susceptible to CO poisoning as hydrogen-air 
fuel cell, but can tolerate up to 0.5% or 5,000 ppm of 
CO [4-7]. 

Molten Carbonate Fuel Cell (MCFC) Molten carbonate 
fuel cell is a stationary power generation system that 
operates around 650° C. It has the advantage of not using 
noble metals at catalyst as a result of high operational 
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temperature. It can use multiple fuels, such as methane 
(CH 4 ), CO, and hydrogen, as fuel. One advantage of 
MCFC as well as SOFC is the use of exit gases for 
cogeneration. That implies using waste heat to operate 
either steam or gas turbine, thereby bottoming the cycle 
and increasing the efficiency of the utilized fuel in the 
entire system. At the exit temperature of 600-650° C, 
the waste heat is rerouted to the boiler to generate 
steam to operate a steam turbine. The turbine in turn 
is connected to a generator or it can be geared to operate 
several devices of the balance of plant. The final waste 
heat at a lower enthalpy can be used for heating a facility. 
This is achieved through routing the city water and the 
waste heat though a heat exchanger, thereby reducing or 
eliminating the extra cost of maintaining the facility 
warm water system. 

The common electrolytes used in MCFC are the 
carbonate salts of sodium, lithium, and potassium. 
The mobile ion is the CO3 -2 that moves from the 
cathode to the anode [7-10]. 

The anode reaction using hydrogen as fuel is: 


2H 2 + 2C0 3 2_ -> 2H 2 0 + 2C0 2 + 4e“ (9) 

The cathode reaction is: 

0 2 T 2C0 2 T 4e —> 2C0 3 2 (10) 

The reaction using CO as fuel is as follows: 

At the anode, it is: 

2CO + 2C0 3 2- -> 4C0 2 + 4e“ (11) 

The cathode reaction is: 

0 2 + 2C0 2 + 4e“ -► 2C0 3 2- (12) 


Molten carbonate fuel cell finds a wide application as 
a stationary power utility plant and is not suitable for 
hybrid electric vehicles purposes. 

The Solid Oxide Fuel Cell (SOFC) The solid oxide fuel 
cell can be used both as a stationary and auxiliary (for 
mobile) power unit (APU) depending on its application. 
Current research favors using it as an auxiliary power 
unit for inter-state trucks, recreational vehicles, auxiliary 
power unit, and recharging unit for soldier power, 
power tools, etc. SOFC uses yttrium-stabilized zinc 
oxide as the electrolyte [7-10]. The operating 
temperature is between 650° C and 1,000°C. Due to its 
high operating temperature, SOFC as of now is not 
considered a candidate for hybrid electric vehicle 


applications, but probably for future locomotive or 
metro-rail transportation systems. 

The reaction at the anode using hydrogen as fuel is: 

2H 2 + 20= -> 2H 2 0 + 4e“ (13) 

The cathode reaction is: 

0 2 + 4e _ —> 20 = (14) 

The reaction using CO as fuel. 

At the anode, the reaction is: 

2CO + 20 = 2C0 2 + 4e“ (15) 

The cathode reaction is: 

0 2 + 4e _ —> 20 = (16) 

Candidate Fuel Cells for Hybridization 

The above overview shows three fuel cells as candidates 
for HEV: PEM, DMFC, and AFC in a descending order. 
Out of these three, AFC is least considered for auto¬ 
motive applications for the following obvious reasons: 

(1) The electrolyte is easily degraded in the presence of 
atmospheric C0 2 . When potassium carbonate is 
formed, less OH - ions are available for ionization, 
and as such, the performance of the cell is degraded. 

(2) The fuel cell lacks flexibility in all possible orienta¬ 
tions. The advantages are overwhelming: (1) The elec¬ 
trolyte is cheap and easily replaceable. (2) The 
technology for KOH production is available which 
makes the fuel source to be sustainable. (3) The absence 
of bipolar plates reduces the cost and technological 
process of AFC production [4]. 

Solid oxide fuel cell SOFC on the other hand can be 
considered in an auxiliary capacity to support recrea¬ 
tion vehicles (RV), large trucks as an APU, but not as 
the main power unit. The disadvantages are: It requires 
some hours to bring it up to the operating temperature 
and shutdown - thermal cycling, and it operates at very 
high temperatures of 650-1,000°C and, as such, is not 
suitable for HEV application. 

Hybrid Technology Using Gasoline or Diesel, Fuel 
Cells, Ultra-Capacitors, and Batteries 

Hybrid Using Gasoline- or Diesel-Powered Engines 

The technology of gasoline and diesel (internal com¬ 
bustion engine) operation is very well understood; 
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hence, little time will be devoted here for its explana¬ 
tion. Internal combustion engines (ICE) are devices 
that convert the chemical energy of the fuel (gasoline 
or diesel) to mechanical work through the piston, 


connecting rod, and crankshaft mechanisms. The 
architecture of the gasoline or diesel hybrid consists of 
an ICE/generator, electric motor, battery, drivetrain, 
and wheels. The schematic is shown below in Fig. 7 
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Hybrid ICE 
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in which ICE/generator for oversimplification is 
represented as ICE. The mechanical power from ICE 
is converted into electrical power through the electric 
generator connected on the same shaft with the engine. 
Electric power is transmitted to the motor. Turbo¬ 
charged ICE has an immense advantage over its natu¬ 
rally aspirated versions. The exhaust gas energy from 
the turbine is harnessed by coupling an electric motor 
generator to the same shaft as the turbo-compressor 
taking advantage of the exhaust gas energy to generate 
extra electricity. During acceleration when the vehicle 
requires more torque/air, the electrical motor is acti¬ 
vated with the extra energy from the turbo-generator. 
When there is excess exhaust energy, that portion 
would be harnessed to recharge the batteries, thereby 
bottoming the cycles of the exhaust gas use in hybrid 
vehicle [11]. The architecture in Fig. 7 shows various 
modes of ICE hybrid propulsion. Figure 7a presents 
idle operation where the generated power is used 
to charge the battery while (b) depicts the normal 
operation of a conventional ICE. The energy path 
is indicated with the arrows. Under normal opera¬ 
tion, the power from the engine is used both for pro¬ 
pulsion and charging the battery as shown in (c) while 
additional energy is needed during acceleration and 
transients from the second power source (battery) 
shown in (d). As an extender, the ICE is turned off 
during traffic in city driving and the energy from the 
battery is used for propulsion (e). The concept of 
regenerative braking involves capturing the kinetic 
energy during braking and deceleration to charge the 
batteries (f). 

Fuel Cell Hybrid Architecture 

The fuel cell hybrid vehicle architecture incorporates 
PEM fuel cell, onboard compressed hydrogen tank, and 
power conditioner, batteries, ultra-capacitors, motor 
and control electronics. A schematic of the PEMFC- 
powered hybrid vehicle using heavy-duty batteries and 
ultra-capacitors is presented in Fig. 8. The PEMFC 
balance of plant includes compressed hydrogen in 
a storage tank at a pressure of 350 bar (5,000 psi) or 
700 bar (10,000 psi); cooling system; humidification 
system; compressed air system (fan, etc.); and an elec¬ 
tronic display panel for controlling the flows into the 
fuel cell. The steps involved in integrating the entire 



Fuel Cell Powered HEV Design and Control. Figure 8 

Fuel cell hybrid architecture 


system include sizing the fuel cell to provide average 
power over the entire operating range [ 12, 13] . The fuel 
cell plant has input and output data logging inlets and 
outlets to monitor the reactants’ flow rates, the voltage, 
current, power, power density, and stoichiometries of 
reactants. A DC-DC converter is required between the 
FC and the load. Sets of batteries and ultra-capacitors 
are also required [ 12] . Charging of the batteries is done 
when the vehicle idles, or prior to using the vehicle. The 
DC-DC converter provides supplementary power for 
the ultra-capacitor’s recharging functions while the 
ultra-capacitors provide power needs during tran¬ 
sients, such as peak loads and transients during accel¬ 
eration in fuel cell-powered vehicles. Some of the 
roadblocks in quick implementation of FC-HBVs are 
hydrogen infrastructure, sub-zero temperature opera¬ 
tion, cold start constraints in the colder regions of the 
country and issues of dealing with fuel cell exhaust 
water during winter months on the road during city 
driving. 

Ultra-Capacitors 

The factor limiting the range and effectiveness of 
a rechargeable-battery-powered electric vehicle is the 
battery (or the energy storage system). Ultra-capacitors 
represent another generation of energy storage devices 
and are electrochemical double layer capacitors that 
supply peak energy requirement during transients, 
such as acceleration and cold start in fuel cell-powered 
vehicles and other hybrid electric vehicles (HEV). 
Figure 9 shows Maxwell’s 2,600 F ultra-capacitors. 
Ultra-capacitors are widely accepted in fuel cell vehicles 
and other HEVs for their quick charge/discharge func¬ 
tions. Some of the advantages are their flexibilities in 
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Fuel Cell Powered HEV Design and Control. Figure 9 

Maxwell ultra-capacitors 


charging/recharging. They cannot be overcharged and 
can be maintained at any voltage above/below the rated 
value. While rechargeable batteries generally exhibit 
a very slow charge/response functions with signifi¬ 
cantly reduced life cycles, ultra-capacitors work best 
in this condition by recuperating within a few seconds 
as a result of their low equivalent series resistance 
(ESR). They possess a high cycle-life with less deterio¬ 
ration when used frequently and infrequently. They 
respond within a wide temperature range of 65-40°C. 
They exhibit high columbic efficiency which implies 
total charge removed over the replenished charge 
[13-15]. 

Conclusion 

In conclusion, both low- and high-temperature proton 
exchange membrane fuel cells will be used in 
fuel cell hybrid vehicles for transportation as they pro¬ 
vide overwhelming advantages over the internal com¬ 
bustion engines, such as zero emissions when hydrogen 
is used as fuel, reduced noise, and higher efficiency. 

The use of integrated technologies such as fuel cell, 
batteries, ultra-capacitors, and other novel techniques 
will occupy the center stage in fuel cell vehicle devel¬ 
opment within this decade. 
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Glossary 

AFC Alkaline fuel cell. 

EC Electrocatalysis. 

FC Fuel cell. 

GDE Gas diffusion electrode. 

GDL Gas diffusion layer. 

HOR Hydrogen oxidation reaction. 

HT-PEFC High temperature polymer electrolyte fuel 
cell. 

MCFC Molten carbonate fuel cell. 

ORR Oxygen reduction reaction. 


PAFC Phosphoric acid fuel cell. 

PEFC Polymer electrolyte fuel cell. 

SOFC Solid oxide fuel cell. 

Definition of the Subject 

Fuel cells are efficient energy converters, based on 
electrochemical principles. They convert the chemical 
energy (heating value) of a fuel directly into electricity, 
circumventing the various steps of thermal conversion 
and electricity generation. Fuel cells can be designed 
and constructed on the basis of a multitude of mate¬ 
rial combinations for electrolyte and electrodes, open¬ 
ing the choice of different fuels. The electrocatalytic 
reactions of fuel and oxygen are major challenges to 
obtain high conversion efficiency. The electrochemical 
basics of different fuel cell types considered today 
for technical applications are described in this 
contribution. 

Introduction 

Fuel cell technologies have seen a revival in recent years, 
due to several reasons. Global warming and local air 
pollution caused by various energy utilization pro¬ 
cesses have created a multitude of environmental 
concerns, promoting the development of novel tech¬ 
nologies with high conversion efficiencies and low 
emissions, possibly zero emission, with respect to 
green house gases and other pollutants [1, 2]. Peak oil 
is another reason for the renewed interest in fuel cell 
technologies, in particular for automotive applications 
[3, 4]. Although this fact is discussed in a highly con¬ 
troversial manner, limitation in crude oil supply is 
obvious in the long-term perspective. This particular 
aspect of fossil fuel resources is strongly interlinked to 
the future perspective of the “oil price” and, hence, its 
economic competitiveness to other fuels, e.g., fuels 
from renewable sources. In this context, the installation 
of new supply infrastructures for alternative fuels, 
e.g., H 2 , is an important additional economical and 
political factor. Dedicated analysis has clearly shown 
that energy conversion in fuel cells has to be based on 
fuels, in particular hydrogen, derived from renewable 
sources [5]. Further, the geographical distribution of 
oil reserves causes concerns about the supply security 
in industrial centers around the world. 
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Overall, there exist several reasons to ask for novel 
efficient conversion technologies for mobility 
(electromobility) and combined heat and power sys¬ 
tems (CHP) with independence on fossil fuels, in par¬ 
ticular crude oil. Another area of interest in fuel cell 
technology is portable electric and electronic applica¬ 
tions, where the argument of potentially higher energy 
density as compared to today’s available battery tech¬ 
nologies, hence, longer time of operation, is of prime 
interest [6]. 

Electricity generation via fuel cells is an idea, based 
on fundamental research carried out by Christian Frie¬ 
drich Schonbein at the University of Basel in the early 
nineteenth century. Schonbein’s basic investigations of 
the respective reaction of H 2 gas (oxidation) and 0 2 gas 
(reduction) dissolved in the electrolyte at an electrode/ 
electrolyte interface, representing the fundamental fuel 
cell reactions, led William Robert Grove to the design 
and construction of the first operating fuel cell. He 
utilized H 2 as fuel and 0 2 as oxidant at platinum 
metal electrodes in two separated closed compart¬ 
ments, filled with the respective reactant (hydrogen 
electrode, oxygen electrode), and joined by sulfuric 
acid as common electrolyte [7]. 

As will be shown in this chapter, the basic electro¬ 
chemistry of a fuel cell, an electrochemical gas battery, 
is determined by the choice of the electrolyte and the 
two different electrodes immersed therein. In the 
course of time since this fundamental work of 
Schonbein and Grove, many combinations of electro¬ 
lytes and electrode materials have been tested and fuel 
cell types derived upon there. This development has 
also been driven in the past by the choice of available 
fuel and possible application in view, in particular by 
space and military applications. 

This introductory chapter will give an overview on 
the basic electrochemistry of some fuel cell types, devel¬ 
oped today for dedicated technological applications. 
The respective electrochemistry will depend on the 
materials composition of the fuel cell, in particular, 
on the nature of the electrolyte and the temperature 
of operation. This further determines the choice of fuel, 
due to the strong influence of temperature on the 
mechanism of the respective electrocatalytic processes. 

Engineering issues of fuel cell stack and systems 
design will be dealt with in the following chapters. 


Hence, issues of, e.g., coflow or counterflow within 
one cell, stoichiometry and utilization of fuel and oxi¬ 
dant, temperature and current distribution in a fuel cell 
of technical scale, and, certainly, issues of stacking cells 
into a bipolar arrangement will not be discussed here. 

What Is a Fuel Cell? 

A fuel cell is an electrochemical device, converting the 
chemical energy (Gibbs free energy) stored in a gaseous 
or liquid fuel, e.g., hydrogen, methane, methanol, 
ethanol, others, directly into work of electrical energy 
(direct current electricity) at constant temperature 
(Fig. 1). This type of energy conversion process is 
different from the classical thermomechanical energy 
conversion process and is not limited by the Carnot 
principle (see below). 

In short, in a fuel cell, the fuel is oxidized at an 
electrochemical interface (electrode called anode), 
accepting electrons and donating these electrons at 
a second electrochemical interface (electrode called 
cathode, separated from the anode) to an oxidant, 
e.g., oxygen, which is reduced by accepting these elec¬ 
trons. Both electrochemical interfaces have to belong to 
a common electrochemical cell and are joined in the 
cell by a common medium, an ion-conducting electro¬ 
lyte. Both electrodes have to be connected electronically 
by an external circuit, containing the electrical device to 
be operated, in which the electrons, due to the potential 
difference created by the two electrode reactions, travel 
from the anode to the cathode delivering electrical 
work (Fig. 2). Fuel and oxidant are supplied in gas 
channels of the cell housing (bipolar plate in stacked 
cells) on the backside of the porous electrodes (not 


chemical _^ heat _^mechanical _^electrical 

energy energy energy 

A 

- FUEL CELL - 


Fuel Cell Types and Their Electrochemistry. Figure 1 

Scheme of direct energy conversion in an electrochemical 
cell as compared to "conventional" thermal conversion 
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(Cathode) (Anode) 


ANODE : H 2 ^2H + + 26- 
CATHODE : 2H + + 2e" + 1/2 0 2 -> H 2 0 

Fuel Cell Types and Their Electrochemistry. Figure 2 

Simplified fuel cell scheme with an acidic aqueous 
electrolyte, e.g., the polymer electrolyte fuel cell (PEFC). 
Fuel: H 2 , oxidant: 0 2 . Only porous gas diffusion electrodes 
and electrolyte are shown, cell housing is not shown [8] 

displayed in Fig. 2) Both gases have to be transported 
through the porous gas diffusion layers (GDLs) with 
pores typically in the micrometer range (blue and red 
bodies in Fig. 2) to the electroactive catalyst layers 
(ECLs, black dots in Fig. 2) at the interface to the 
electrolyte. Colloquially, GDL and ECL together are 
called gas diffusion electrode (GDE). 

The fuel cell and its electrochemically active com¬ 
ponents, i.e., electrodes, electrolyte, etc., as well as its 
(electrochemically inert) structure materials, i.e., cur¬ 
rent collectors, cell housing, etc., should be as invariant 
as possible, i.e., they should not be consumed and, 
ideally, not age (corrode) over the time of operation. 
Hence, as an electrochemical reactor, they provide the 
electrochemically active interfaces (or interphases, see 
below) and the necessary pathways for mass transport 
for educts and products to and from these active inter¬ 
faces through porous media (active electrode layers, gas 
diffusion layers, internally corrugated cell housing 
(flow fields) in bipolar plates) with open porosity at 


different scales. At the same time, it is a prerequisite 
that these materials are as conductive as possible 
because they are responsible for the collection and 
transmission of the electric current generated at the 
two interfaces. Hence, ohmic voltage losses in these 
materials should be as low as possible. 

The electrochemistry of the two electrode reactions 
is exemplified for the simplest and predominant case by 
the “cold” electrochemical combustion of H 2 with 0 2 
(pure 0 2 or from ambient air) to H 2 0 (Fig. 2). The 
overall reaction is split into two partial reactions, 
occurring at the two different electrodes of the cell: 

Anodic reaction : H 2 = 2H + 2e _ 

(hydrogen oxidation reaction, HOR) 

Cathodic reaction : l/20 2 + 2H + + 2e“ = H 2 0 

(oxygen reduction reaction, ORR) 

Overall reaction : H 2 + 1 / 2 0 2 = H 2 0 

The reaction product(s) of the individual electrode 
reactions are constituents of the (aqueous) electrolyte 
(H + in the example of an acidic electrolyte, as displayed 
in the scheme of Fig. 2), which avoids polarization 
losses in the electrolyte (see Chapter “► Livestock 
Somatic Cell Nuclear Transfer”). 

Each of the two electrode reactions creates 
a characteristic potential difference across the interface 
solid electrode/electrolyte, which is different for the 
two reactions according to the different reactants. The 
overall cell voltage between the two electrodes, which 
are joined by the same electrolyte, allows the electrons 
generated at the anode (HOR) and consumed at the 
cathode (ORR) to create work in the external circuit. 
Hence, chemical energy released by the individual elec¬ 
trode reactions at the locally separated electrodes is 
directly transferred into electrical energy. This pathway 
is different from the combustion step in the “classical” 
thermomechanical power generation, where the oxida¬ 
tion of fuel and reduction of oxidant occur in the same 
volume element, thereby generating heat only. 

Generally, the available cell voltage of electrochem¬ 
ical cells depends on the thermodynamics of the two 
electrode reactions in the prevailing electrolyte, hence 
the difference in the electrode potentials, and is con¬ 
fined, according to the series of electrochemical poten¬ 
tials, to a few volts [9]. According to the individual 
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electrode potentials of the H 2 /H + reaction (by IUPAC 
standard zero volt in the series of electrochemical 
potentials, acidic electrolyte, standard conditions of 
1 atm and 25 °C or 298K) and the 0 2 /H 2 0 reaction 
(1.23 V, respectively), a cell with H 2 and 0 2 as reactants 
should yield an ideal cell voltage of 1.23 V at these 
standard conditions. In practice, a lower value in the 
range of 1 V is observed, due to different implications 
(side reactions, depolarization of electrodes due to 
crossover of gases through the electrolyte, etc.). 

The free energy AG (Gibbs energy at constant pres¬ 
sure) of the fuel cell reaction is related to the cell voltage 
under open circuit conditions (open circuit voltage, 
OCV) E° and standard conditions according to 

E° = -AG/nF 

n being the number of electrons transferred in 
the respective reaction (equal to 2 for the reaction 
H 2 + 1 /20 2 = H 2 0) and F the Faraday constant equal 
to 96,487 C/mol (electric charge per mol of electrons). 

Further, AG can be expressed as the difference of the 
free enthalpy AH of the reaction (thermal cell voltage 
E h ° equivalent to 1.48 V for the H 2 /0 2 reaction under 
standard state conditions) and the term TAS, (T abso¬ 
lute temperature) expressing entropy losses (or gains) 
AS in the reaction: 



Fuel Cell Types and Their Electrochemistry. Figure 3 

Voltage (thermal) efficiency e t h for the electrochemical 
reaction of H 2/ CO, and CH 4 as fuel with oxygen from air 
under equilibrium versus temperature (°C). In comparison, 
Carnot efficiency of a thermal process as function of the 
respective higher temperature in °C and with 0 °C as lower 
temperature is displayed 


AG = AH - TAS 

Hence, the voltage (thermal) efficiency of a reaction 
under equilibrium conditions can be written as: 

_ AG _ TAS 

Sth ~ AH ~ 1 “ AH 

The voltage (thermal) efficiency for the electro¬ 
chemical conversion of the three different fuels H 2 , 
CH 4 , and CO reacting with air (oxygen) in dependence 
of reaction temperature is displayed in Fig. 3. In com¬ 
parison, also the maximal efficiency r| = (T x —'T 2 )/T x for 
a Carnot process at the respective higher temperature 
Ti and the lower temperature T 2 equal to 0 C is 
included. The different temperature dependence of 
the three conversion reactions is due to the respective 
entropy changes, e.g., AS = 0 for the reaction 
CH 4 + 20 2 = C0 2 + 2H 2 0 and AS ^ 0 for the reactions 
2H 2 + 0 2 = 2H 2 0 and 2CO + 0 2 = 2C0 2 . 

Based on thermodynamic arguments only, electro¬ 
chemical energy conversion offers higher efficiencies 


within a certain temperature window, depending on 
the respective reaction. 

Introducing Nernst’s law for the equilibrium case, 
the situation when no current (and hence power) is 
delivered by the cell, the equilibrium cell voltage under 
nonstandard conditions for a H 2 /0 2 cell in dependence 
of the respective reactant/product concentration (par¬ 
tial pressures) can be expressed as: 


E eq = — = E ec1,c — F eq,a 

n t F 


o RT, 

= E° H-In 

IF 


Ah 2 ][o 2 ] 1/2 \ 

V [Hi O] ) 


E eq = equilibrium cell potential, E eq,c = equilibrium 
potential cathode, E eq,a equilibrium potential anode, 
R gas constant, T absolute temperature, E° = equilib¬ 
rium potential under standard state. 

E eq is also called “theoretical” open circuit cell volt¬ 
age, OCV, in dependence of temperature and reactant, 
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respectively product concentrations. Practical OCVs 
may observed to be lower, due to losses (mixed poten¬ 
tials due to side reactions or gas crossover, see practical 
OCV of H 2 /0 2 fuel cell) already occurring at zero 
current flow. 

Under current flow, the kinetics of the electrode 
reactions comes into effect. The Butler-Volmer equa¬ 
tion describes the dependency of the current passing 
through the electrode interface (current density per 
unit geometric area) on a small voltage excursion 
(called overvoltage r|) from the respective equilibrium 
potential E eq . 

i = i 0 [exp(anF? 7 /RT) — exp((l — a)nF? 7 /RT)] 

i = current density at overvoltage r|, q = overvoltage 
(excursion from the equilibrium potential, i 0 = 
exchange current density (at r\ = 0), a = symmetry 
factor, representing the fraction of overvoltage assisting 
the reaction, n = number of electrons exchanged, 
F = Faraday constant, R = gas constant, T = absolute 
temperature. 

The exponential dependency points out that small 
potential excursions cause a larger change in current, 
while at larger potential excursions, the current follows 
a linear behavior. This can be described by the Tafel law, 
which usually is applied when current densities of tech¬ 
nical interest, e.g., several 100 mA per cm 2 geometric 
surface area in a fuel cell, are passing through the cell. 

Tj = (RT/anF) In i/i 0 

For the HOR, exhibiting an exchange current den¬ 
sity in the range of i 0 = 10 -3 A/cm 2 with platinum as 
electrode material [9], only a small overvoltage arises 
even by drawing a current density of technical interest, 
e.g., 1 A/cm 2 , due to the fact that the H 2 molecule is 
easily split into two adsorbed H atoms (Tafel reaction), 
followed by the transfer of one electron per H atom 
(Volmer reaction) to the electrode generating two 
hydrated H + . Alternatively, one electron is transferred 
to an adsorbed H 2 molecule, followed by a split into a 
hydrated H + and an adsorbed H atom. In a consecutive 
step, the second H atom is oxidized (Heyrovsky 
reaction). 

In contrast, the exchange current density of 
the ORR at a platinum electrode is in the range of 
i 0 = 10 -8 A/cm 2 . Assuming direct reduction of the 
oxygen molecule, the ORR requires four electrons in 


total, including the four H + generated in the HOR at 
the counterelectrode (see above). As a consequence, 
ORR generates a major overvoltage and, hence a 
major contribution to the overall loss. 

In addition to the overvoltage losses due to elec¬ 
trode kinetics, one has to take into account the voltage 
loss in the ion-conducting electrolyte, due to the finite 
electrolyte conductivity, and (minimized) losses due to 
the electric resistance of electrode and cell materials, 
including contact resistances. 

This relationship between cell voltage and current 
density is schematically displayed in Fig. 4. The various 
voltage losses are deducted from the equilibrium cell 
voltage (zero current), yielding the available cell voltage 
E* at the respective current density i*. The difference 
between E° H and E* is expressed as (waste) heat. In 
terms of voltage efficiency, this (heat) loss should be as 
low as possible; hence, E* should be as high as possible. 
On the other side, the current density to be drawn has 
to be as high as possible to utilize power output values 
(specific power W/kg or power density W/F) of tech¬ 
nical interest. 

Further, Fig. 4 points to another advantage of (elec¬ 
trochemical) fuel cell technology: Designing a fuel cell 
to the maximum or rated power for a certain applica¬ 
tion (power = cell voltage E times current i), one easily 
realizes that the voltage efficiency s e ff increases at lower 
power (lower current but higher voltage). This fact is 



Fuel Cell Types and Their Electrochemistry. Figure 4 

Cell voltage E versus current / of a fuel cell, with E H ° = 
thermal cell voltage, E G ° = ideal cell voltage based on AG, 
q = sum of overvoltage at anode and cathode, REiectroiyte = 
electrolyte resistance 
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particularly important for applications where maxi¬ 
mum power (full load) is only required at certain 
parts of the duty cycle, while in other parts, only part 
load is necessary. This case is, e.g., exhibited in auto¬ 
motive applications of fuel cell technology. 

Generally, the available cell voltage of electrochem¬ 
ical cells depends on the thermodynamics of the two 
electrode reactions in the prevailing electrolyte, hence 
the difference in the electrode potentials, and is con¬ 
fined, according to the Electrochemical Series of Stan¬ 
dard Potentials, to a few volts. Cells with an aqueous 
electrolyte exhibit a limitation given by the stability 
window of water, namely 1.23 V at standard conditions. 
As stated above, the H 2 /0 2 fuel cell allows practical 
open circuit voltages of around 1.0 V. At cell voltages 
above 1.23 V, typically around 1.5 V, decomposition of 
water into H 2 and 0 2 occurs. 

Hence, to accumulate the necessary voltage for 
technical applications, e.g., 200-400 V, for an electrical 
power train in a car, cells must be connected in series. 
Dedicated bipolar arrangements of cells have been 
designed and put into operation for serial connection, 
taking into consideration also the necessary parallel 
mass flow of fuel and oxidant from a manifold into 
each individual cell and the respective removal of the 
product. Such an arrangement of cells is called a fuel 
cell stack, combining the electrical serial connection 
of individual cells with a parallel connection for 
mass flow. 

In contrast to batteries, fuel cells are open systems, 
which convert the chemical energy available in a fuel 


stored outside the fuel cell, the electrochemical con¬ 
verter. As a consequence, fuel cells need a fuel tank, also 
a tank for the oxidant, if the oxidant is pure oxygen and 
not ambient air, and auxiliaries (for temperature, pres¬ 
sure, etc., control) to be operated (Fig. 5). 

Mass flow into (reactants) and out of the cell (prod¬ 
ucts) must be allowed and controlled, as well as cooling 
to remove the waste heat. At a voltage efficiency of 50%, 
a cell producing 1 kW electric power also generates 
1 kW of heat, which has to be removed efficiently not 
to cause thermal runaway and, hence, degradation of 
the cell and its components. In summary, this is called 
a fuel cell system. Engineering issues of mass and heat 
transfer, current distribution across the active area of 
a cell, and voltage distribution along the stack, etc., are 
governing the design of a fuel cell system. 

Fuel cells can be operated continuously, as long as 
they are supplied with fuel and oxidant. Hence, they are 
not limited by electrical charge-discharge cycles, as 
batteries are. This fact can be of utmost importance 
for certain applications, e.g., with respect to refueling 
time in mobile applications. For mobile or portable 
applications, the availability of a fuel infrastructure 
adds another aspect of consideration. 

In terms of achievable specific energy (W/kg) 
and volumetric energy density (W/F), air-fed fuel cells 
have the advantage that one reaction partner, namely, 
oxygen, is derived from air, and consequently does 
not add to the weight and volume of the device. 
However, ambient air must be processed (cleaned, 
humidified, etc.), which has to be taken into account. 



Surplus air 


Fuel cell stack 
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Fuel Cell Types and Their Electrochemistry. Figure 5 

Simplified scheme of a fuel cell system (Adapted from L. Gubler, Paul Scherrer Institut) 
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This advantage is also expressed in the concept of metal 
air batteries, e.g., the Zn/Air or Al/Air battery, which on 
the anode (metal) consists as half a battery (closed mass 
flow) and on the cathode as half a fuel cell (open 
mass flow). 

Next to the multitude of material combinations 
allowing various fuel cell types to be designed and 
built, the variety of mobile (transport), stationary, 
and portable applications provokes a multitude of dif¬ 
ferent cell, stack, and systems designs, guided by the 
implications of the different fuel cell technologies (see 
below). 

The Role of the Electrolyte 

Generally speaking, the ion conduction in the electro¬ 
lyte contributes to the losses (ohmic losses, see Fig. 3) 
in the cell voltage. Hence, one of the major tasks in fuel 
cell development is the reduction of these losses in the 
ionic circuit part, because the work available in the 
external electronic circuit should be as high as possible. 


The role of the “electrolyte” in a fuel cell is 

a multiple one: 

1. Transport the charge in ionic form from one inter¬ 
face to the other within the cell. To minimize losses 
in the ionic circuit, the ion conduction should be 
carried by an ion, which is produced at one elec¬ 
trode and consumed at the other to avoid losses 
caused by concentration gradients [10]. This 
majority ion should carry as much charge as possi¬ 
ble through the electrolyte, hence its transport 
number (Hittorf number) should be as high as 
possible, ideally one. For high power applications 
with an aqueous electrolyte, the concentration of 
this respective anion or cation must be high and 
exclusively responsible for conduction. Typically, 
a specific ionic conductivity in the range of 
100 mS/cm is required, which at a current density 
of 1 A/cm 2 and an electrolyte gap of 100 pm would 
yield an ohmic voltage loss in the range of 100 mV. 
As seen in Fig. 6, the conductivities of various 
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Fuel Cell Types and Their Electrochemistry. Figure 6 

Specific conductivity versus temperature of different electrolytes interesting for fuel cell applications (Adapted from 
K.D. Kreuer) 
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electrolytes are strongly temperature dependent 
and cover a wide temperature range, depending 
on the ion-conducting species in its respective envi¬ 
ronment (material). Again, electrolytes are of inter¬ 
est, which offer the opportunity that the ionic 
species is participating in the fuel cell reaction. 

2. Act as separator to (1) avoid crossover and, as 
a consequence, “hot” combustion of the fuel and 
the oxidant at the respective counterelectrode, and 
(2) to avoid touching of the electrodes by short 
circuiting the electronic pathway. Thereby, the elec¬ 
trolyte/separator gap should be as narrow as possi¬ 
ble to lower the ohmic contribution to the overall 
voltage loss. In the case of a liquid electrolyte, it can 
be absorbed into an inert matrix, providing poros¬ 
ity high enough for an ionic conduction path with 
low tortuosity for the ion and at the same time 
a high enough bubble pressure to suppress gas 
crossover. An ideal concept is the one of a solid 
electrolyte, which fulfills the dual electrolyte and 
separator function at the same time, providing 
a transfer number of one for the current-carrying 
ionic species, as in the case of solid polymer elec¬ 
trolytes (ion exchange membranes), thereby 
excluding a contribution of electronic conduction. 

One has to emphasize that the electrolyte has 
to sustain the potential window of the respective 
fuel cell reaction, given by the Gibbs free energy, at 
least over the device lifetime, specified for a certain 
application. 

In aqueous acidic electrolytes, the ionic conduction 
is provided by an H + ion, respectively, H(H 2 0) n + . 
As mentioned above, H + is created by the anodic oxi¬ 
dation of H 2 as a fuel, and the conducting species is 
transported to the cathode, where it is consumed in the 
direct (ideally four electrons) cathodic reduction of 
molecular oxygen. Hereby, water as the reaction prod¬ 
uct appears at the cathode side of the cell. 

In alkaline solution, the ionic conduction is pro¬ 
vided by OH - ions, created by the ORR at the cathode. 
Principally, OH - ions are carriers for the O 2- ions, 
produced as intermediate by the cathodic reduction 
reaction of molecular 0 2 and the follow-up reaction 
with a water molecule. After conduction, OH - reacts 
with the H + created at the anode and yields water as 
a product at the anode side. 


Solid polymer electrolytes on the basis of a poly¬ 
mer cation exchange membrane in H + form or an 
anion exchange polymer membrane in OH - form can 
be considered as quasi-aqueous electrolytes, whereby 
the water is absorbed in the phase separated ionic 
nanomorphology of the respective material. This 
nanomorphology forms ionic pathways through the 
polymeric membrane connecting the two fuel cell 
electrodes. 

In cells operated at temperatures below 100 °C, 
liquid water will be the reaction product. For operation 
temperatures above the boiling point of water, a cell 
with an aqueous or quasi-aqueous electrolyte can be 
operated, however, at the expense of pressurizing it to 
avoid loss of the water and, as a consequence, concen¬ 
tration and conductivity changes in the electrolyte. 

An alternative electrolyte for operating tempera¬ 
tures above 100 °C, which also gained technical rele¬ 
vance, is orthophosphoric acid, H 3 P0 4 . “Water-free” 
orthophosphoric acid is a self-ionizing amphoteric sys¬ 
tem, which yields H 4 P0 4 + and H 2 P0 4 _ . Protons can be 
passed along pyrophosphate chains at elevated temper¬ 
atures of around 200 °C, at which product water 
appears as vapor and can be rejected easily. 

Molten salts can also be utilized as electrolytes. 
Again, the ion responsible for conductivity should be 
created in one electrode reaction and consumed in the 
other. However, due to the (1) corrosive nature, (2) the 
limited thermal stability, and/or (3) the high melting 
point of many melts, only a few molten salts have been 
considered. Up to now, the only fuel cell technology 
based on a molten salt electrolyte has been using 
a eutectic mixture of alkali metal carbonates, Li 2 C0 3 
and K 2 C0 3 , at temperatures of around 650 °C, 
absorbed in a matrix of LiA10 2 . In this molten carbon¬ 
ate fuel cell (MCFC) the ion conducting the current is 
0 2 ~ in the form of C0 3 2- . At the anode, C0 2 is 
supplied and reacts with 0 2 and 4e _ to form C0 3 2- , 
which carries the current to the anode, where it is 
released in the hydrogen oxidation reaction. As 
a consequence, MCFC, in addition to the mass flow of 
fuel and oxidant, needs mass flow management of C0 2 . 

At even higher temperatures in the range of 
800-900 °C, ionic conduction can also be observed in 
solid oxides. Certain doped metal oxides become 
0 2 ~ conductors; hence, they can be utilized as electro¬ 
lytes in the solid oxide fuel cell (SOFC). The most 
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common oxide used is yttria-stabilized cubic zirconia 
(Zr0 2 ), called YSZ, with a doping level of 8% Y 2 0 3 . 
This material is a ceramic; the thermal expansion coef¬ 
ficient of its thin layers has to be matched with the 
other fuel cell components. 

As indicated by the terminology of fuel cells, e.g., 
PEFC, AFC, SOFC, others (see below), the electrolyte is 
the decisive cell component, which determines operation 
temperature, the choice the electrodes (electrocatalytic 
materials), and finally the specifics of the electrochemistry 
of the reactants. This, on the other hand, has conse¬ 
quences for the layout of the fuel cell design (balance of 
plant) and possible applications of the respective fuel cell 
technology, according to the required duty cycle of the 
application. 

Classification of Fuel Cell Types 

Today’s fuel cell types under development can be clas¬ 
sified in two different ways: either by the temperature 
range of operation or by the nature of the electrolyte. 

The latter classification distinguishes between 
acidic and alkaline fuel cells. Acidic fuel cells with H + 
conduction require other electrode materials than alka¬ 
line with OH - conduction. Alkaline fuel cells, at least 
with an aqueous electrolyte, require a C0 2 -free fuel, 
and hence cannot be operated on reformate fuel 
obtained originally from a C-containing fuel. 

The above-described situation displays the fact that 
there is a temperature gap in the availability of electro¬ 
lytes between ca. 200 °C and 650 °C. Today, major 
research efforts are carried out to extend the range of 
aqueous H + and OH - conduction to temperatures 
beyond 100 °C, for technological reasons (heat rejec¬ 
tion in cars, CO tolerance of platinum-based catalysts). 
On the other hand, oxide materials are developed with 
improved 0 2 ~ conductivity al temperatures lower than 
the operation temperatures of standard SOFCs. Also 
ceramic proton conductors [11] and proton¬ 
conducting salts like CsHS04 [12] are considered as 
electrolytes for this temperature regime. 

The Role of Electrocatalysis and the Electrode/ 
Electrolyte Interface 

Next to the voltage losses due to ionic conduction, 
voltage losses due to the activation overvoltages of the 
respective electrode reactions arise, as described by the 


Butler-Volmer, respectively, the Tafel equation. As 
stated above, at a certain value of the current density, 
the overvoltage for the HOR is much smaller than for 
the ORR, due to the simpler electron transfer kinetics. 
Overvoltage for an electrochemical reaction shows also 
a strong temperature dependency, as does activation 
energy, and decreases with increasing temperature. 

For fuel cells with an acidic electrolyte, platinum is 
the electrocatalyst of choice at temperatures up to 
200 °C (e.g., PAFC), due to the stability requirements. 
High dispersion of the catalyst is required, taking into 
account its nature as a precious metal, to provide a high 
surface area to volume ratio. Platinum nanoparticles in 
the diameter range of a few nanometers, typically 
2-5 nm, supported on carbon particles are utilized. 
Different carbons with different surface areas (up to 
800 m 2 /g) are used. The amount of platinum in the 
catalyst powder is specified as wt% per g of carbon 
and further as mg loading per cm 2 of geometric 
electrode area. 

Platinum is prone to CO adsorption at tempera¬ 
tures of around 100 °C and below. For this reason, care 
has to be taken with respect to the purity of hydrogen, 
in particular for hydrogen liberated from 
a C-containing fuel by a reforming process (steam 
reforming, partial oxidation, autothermal reforming). 

Alkaline electrolytes allow cheaper, non-precious 
metal catalysts like nickel and its alloys. 

High temperature fuel cells, e.g., MCFC or SOFC, 
require other catalyst materials, which are described in 
the respective section below. 

There is a common problem to all material selec¬ 
tions for the different fuel cell types, namely, to gener¬ 
ate an optimal electrolyte/electrode interface. One has 
to realize that gaseous molecular reactants are 
converted into ionic species, solvated/hydrated in 
their electrolyte medium, by exchanging electrons 
with the electrode material. This reaction occurs at 
the so-called triple phase boundary (triple point), 
where electron conducting, ion conducting, and gas 
phase (eventually dissolved in the electrolyte) join to 
each other. To allow a high surface area for current 
generation, the electrolyte/electrode interface is 
extended into a three-dimensional interphase with 
a certain thickness. For a polymer electrolyte fuel cell, 
this interphase is drafted in Fig. 7 for the cathode 
(oxygen) side. 
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(shown for cathode) 


Fuel Cell Types and Their Electrochemistry. Figure 7 

Electrode layer (interphase) with three phase boundary 
(schematic) of a polymer electrolyte fuel cell (cathode side). 
Blue : polymer electrolyte, black : carbon particles, gray : 
platinum nanoparticles (Adapted from L. Gubler, 

Paul Scherrer Institut) 


Recently, also anion exchange membranes, alkaline solid 
polymer electrolytes, have been considered [13]. 

The molten carbonate fuel cell, MCFC, and the 
solid oxide fuel cell, SOFC, are high temperature fuel 
cells, due to their temperatures of operation of around 
650 °C, respectively, 800-900 °C. Both can be consid¬ 
ered as alkaline fuel cells. 

The local appearance of the reaction product is 
determined by the current-carrying ion. While for 
acidic fuel cells the reaction product shows up at the 
cathode (oxidant side), it is the anode (fuel side) for 
alkaline fuel cells. In case of C-containing fuels, e.g., 
methanol, a second reaction product containing the 
oxidized carbon appears at the fuel side. The appear¬ 
ance and hence the removal of the product/s causes 
consequences for the mass flow engineering of the stack 
and fuel cell system. 

The following ordering of fuel cell types does not 
reflect the historical development of the various fuel 
cell technologies. 



Depending on the respective materials for the dif¬ 
ferent fuel cell types, this three-dimensional inter phase 
has to be established by very different preparation/ 
manufacturing procedures. In particular, solid electro¬ 
lyte materials must be processable to be included into 
the electrode layer. 

Fuel Cell Families 

The various fuel cell types considered today having 
some technical relevance are displayed schematically 
in Fig. 8. They are arranged according to the nature of 
electrolyte, acidic or alkaline, and to their temperature 
of operation. 

The polymer electrolyte fuel cell, PEFC, and the 
phosphoric acid fuel cell, PAFC, are acidic fuel cells; 
the PEFC operates in the temperature range below and 
around 100 °C (PEFC), respectively, and the PAFC at 
around 200 °C. Hydrogen is the preferred fuel for both 
types. A PEFC can also be fed by liquid or gaseous 
methanol, called direct methanol fuel cell, DMFC. 
Other fuels based on alcohols, e.g., the direct ethanol 
fuel cell, DEFC, are subject to research. Fuel cell types 
utilizing an acidic aqueous electrolyte will not be con¬ 
sidered here. 

Most concepts for an alkaline fuel cell in the past 
have been described with an aqueous alkaline electrolyte. 


Polymer Electrolyte Fuel Cell Types 

The idea of using a thin ion-conducting polymer mem¬ 
brane, solid polymer electrolyte , as electrolyte and sepa¬ 
rator can lead to different concepts of cells. Firstly, 
when ionic charges, anions or cations, are chemically 
bound to the polymer (ionomer) network, the respec¬ 
tive countercharge can move freely within the polymer 
volume, provided a certain volumetric charge density 
within the polymer exists, which on uptake of, e.g., 
water leads to phase separation into a hydrophobic 
polymer (backbone) and a hydrophilic, charge- 
containing phase [14]. As a consequence, the polymer 
morphology allows the continuous transport of this 
ion from one electrode interphase to the other. This 
concept is normally called polymer electrolyte fuel cell 
(PEFC). 

Depending on the fuel, H 2 , MeOH (gas or liquid), 
or others, the solid polymer electrolyte properties have 
to be tailored, in particular for their properties toward 
separation of fuel and oxidant. 

Further, non-ionomeric polymers can absorb an 
electrolyte, e.g., phosphoric acid H 3 P0 4 , which then 
renders ion conducting. This concept is realized in 
the so-called HT-PEFC, high temperature PEFC 
(ca. 100-200 °C), where a polybenzimidazole film 
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The important fuel cell families 



Fuel Cell Types and Their Electrochemistry. Figure 8 

The important fuel cell families of today 


imbibed with phosphoric acid becomes proton 
conducting due to the mechanism described above 
for non-aqueous phosphoric acid (see this book, 
Chapter Benicewicz). 

H 2 -Fed PEFC This type of H 2 -fed fuel cell, due to its 
high achievable specific and volumetric power density 
(in the range of 1 kW/kg and 1 kW/L and above), its 
cold start behavior, and its fast load following proper¬ 
ties, has found interest in the automotive industry and 
is under development by all major automotive 
companies. 

A thin ion-conducting polymer sheet, typically in 
the range of 25 pm, is utilized as a solid electrolyte, i.e., 
electrolyte and separator, in the PEFC (Fig. 1). For 
thermal stability reasons, only cation exchange mem¬ 
branes in the H + form have been considered up to 
today for technical applications (see also “Alkaline 
Fuel Cell”). In comparison to cells with liquid electro¬ 
lyte, this PEFC concept offers the advantage that the 


“electrolyte” is chemically bound within the polymer 
matrix and only water as reactant, in addition to the 
gases, appears in the peripheral system components. 
Membranes are perfluorinated or partially fluorinated 
polymers, with side chains ending in pendant acid 
groups, e.g., the sulfonic acid group -S0 3 H (Fig. 9). 

Under operation, the membrane has to contain 
some water, e.g., 15 water molecules per sulfonic acid 
group, to provide the necessary specific conductivity. 
This fact causes consequences for some of the system 
auxiliaries, as the gases, hydrogen and air, have to be 
humidified before flowing into the cell to sustain the 
hydration level in the membrane. 

The acidic electrolyte requires a precious metal 
catalyst; hence platinum supported on carbon particles 
serves as electrocatalyst, with typical Pt loadings of ca. 
0.1 mg/cm 2 at the anode side and ca. 0.4 mg/cm 2 at the 
cathode side. Platinum nanoparticles, typically a few 
(3-5) nanometers in diameter, are deposited on various 
carbon substrates (e.g., 20-40%Pt/C) by wet chemical 
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Fuel Cell Types and Their Electrochemistry. Figure 9 

Generalized structure formula of Nation® type membranes 
(DuPont) with m = 1,1 = 1, k = 5-7 

processes and then further processed in combination 
with solubilized ionomer material and binder(s) 
(PTFE) to yield an ink, which then is applied either to 
the electrolyte membrane surface (CCM, catalyst- 
coated membrane) or to the GDL to form the GDE. 

Extensive characterization of these electrocatalysts 
for the fuel cell reactions has shown that the 
electrocatalytic activity of these nanoparticles is by 
a factor 10 lower than the activity of a polycrystalline 
platinum surface for the respective reactions, HOR or 
ORR [15]. This difference is not fully understood yet. 

An interesting proprietary approach has been 
followed recently by the 3 M Company, creating a con¬ 
tinuous electrode area covered by nanoscale Pt whis¬ 
kers. Reactivity of these electrocatalytic layers is in the 
range of polycrystalline platinum surfaces [16]. 

Today, pure hydrogen is considered as the ideal fuel. 
Due to its CO content, hydrogen derived from 
C-containing fuels (by steam reforming (SR), partial 
oxidation (POX), or autothermal reforming (ATR) 
followed by preferential oxidation (PROX)) would 
need specific measures in terms of electrocatalysis, 
e.g., PtRu as bifunctional electrocatalyst, to allow oxi¬ 
dation of CO. As an alternative, purification methods 
of the anode gas stream prior to entering the cell down 
to only a few parts per million CO (depending on 
operation temperature) are necessary. 

The electrocatalytic layers are contacted by gas 
diffusion layers, allowing the gases to be passed to 
the interphase and the product(s), H 2 0 with H 2 
and 0 2 as reactants, which consist of carbon cloth 
or paper (Fig. 2 ). This arrangement of membrane, 
electrocatalytic layers, and gas diffusion layers is collo¬ 
quially called membrane electrode assembly (MEA). 

Most of the applications of this fuel cell type are 
developed for air operation, taking advantage of utiliz¬ 
ing one reactant from the ambient environment. How¬ 
ever, there exist also some applications in which pure 
oxygen is employed as oxidant. Due to the higher 


partial pressure of oxygen and the absence of electro- 
chemically inert nitrogen as the majority component in 
the cathodic gas stream, humidification issues of the 
cell can be strongly simplified [17, 18]. 

Methanol-Fed Fuel Cell (PEFC Type) The same con¬ 
cept of combination of an acidic solid polymer electro¬ 
lyte and acid stable precious metal electrocatalysts can 
also be applied to a methanol-fed fuel cell (direct meth¬ 
anol fuel cell, DMFC). Methanol can be fed in liquid or 
vapor form, mixed with water. The methanol molecule 
CH 3 OH is electro chemically converted at the anode: 

Anodic reaction : CH 3 OH + H 2 0 = 6H + 6e“ + C0 2 

(methanol oxidation reaction, MOR) 

Cathodic reaction : 0 2 + 4H + + 4e“ = 2H 2 0 

(oxygen reduction reaction, ORR) 

Overall reaction : CH 3 OH + H 2 0 + 0 2 = C0 2 + 3H 2 0 

As seen above, during MOR, one molecule of water 
is consumed, which is necessary to oxidize carbon in 
the CH 3 OH molecule to C0 2 , while potentially six 
protons are liberated. The ORR can be formulated the 
same way as in a hydrogen-fed cell. 

The equilibrium potential for the anode reaction is 
0.02 V; hence, a theoretical cell voltage close to the 
H 2 /0 2 fuel cell should be observed. In practice, the 
OCV is lower, for several reasons: Due to the similarity 
of methanol and water as solvents (e.g., solubility 
parameter), methanol also penetrates into the water- 
swollen polymer membrane and passes to the cathode 
(methanol crossover), causing a mixed potential at 
a lower value than the oxygen potential. Further, the 
anode reaction requires a binary or ternary bifunc- 
tional catalyst, containing next to platinum one metal 
component (or two) providing the splitting reaction of 
the water molecule involved to liberate oxygen for the 
carbon oxidation at lower potentials as compared to 
platinum. Examples would be Ru, Sn, Mo, others, or 
even combinations of two of these together with plat¬ 
inum as the hydrogen-liberating catalyst. Intermediate 
oxidation species of the methanol molecule, e.g., 
-COH, may adsorb and poison the platinum catalyst 
surface, thereby impeding the full oxidation reaction. 

These kinetic losses at the anode lead to a high 
anode overvoltage and, therefore, the cell voltage in 
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the DMFC is lower at a respective value of current 
density as compared to a H 2 -fed cell, as is the achiev¬ 
able power density. 

Direct methanol fuel cell concepts with liquid elec¬ 
trolyte have also been considered in the past. One 
advantage of circulating liquid electrolytes is the option 
to cool the cell without additional cooling fluid. 

High Temperature PEFC As mentioned above, 
H3PO4 can be absorbed in polybenzimidazole films to 
yield water-free proton-conducting membranes. The 
concept is strongly related to the phosphoric acid fuel 
cell, PAFC. These cells can be operated above 100 °C, 
typically in the temperature range of 160-180 °C, with¬ 
out humidification, allowing to simplify the balance of 
plant. Further, due to the higher operation temperature 
as compared to H 2 -fed cells, CO tolerance is drastically 
increased. Disadvantages are the loss of H3PO4 during 
start-stop cycles, i.e., excursions to temperatures below 
100 °C, where liquid product water causes leaching of 
the ionic component. In addition, achievable power 
densities are typically lower than with H 2 -fed cells, 
due to the fact that phosphate cations strongly adsorb 
on platinum even at these temperatures, impeding the 
ORR. For more detailed information please see 
chapter “Benicewicz”. 

Phosphoric Acid Fuel Cell 

Looking back at the development of fuel cell technolo¬ 
gies, the PAFC technology has been the earliest one to 
be commercialized. Several 200 kW e power units for co¬ 
generation have been delivered to customers over the 
past 20 years, mainly by the US company UTC Power 
[19]. Water-free phosphoric acid is absorbed in an 
inorganic diaphragm and contacted on both sides by 
platinum-containing gas diffusion electrodes. Advan¬ 
tages and disadvantages are very similar to the ones of 
the HT-PEFC. 

Alkaline Fuel Cell 

Alkaline electrolytes played an important role in the 
beginning of fuel cell development, due to its less strin¬ 
gent requirements for electro catalysts. An aqueous 
solution of NaOH or KOH at higher concentration is 
absorbed in a matrix, formerly asbestos, today 
a composite material of an oxide and a polymer binder. 


The electrochemistry of the respective HOR and ORR 
can be written the following way: 

Anodic reaction : H 2 + 20H - = 2H 2 0 + 2e“ 

(hydrogen oxidation reaction, HOR) 

Cathodic reaction : l/20 2 + 2H 2 0 + 2e _ = 20H - 

(oxygen reduction reaction, ORR) 

Overall reaction : H 2 + l /i0 2 — H 2 0 

As described above, an alkaline cell generally has to 
be operated on H 2 and 0 2 , due to the C0 2 problem. As 
advantage, a non-precious metal, such as Raney nickel, 
can serve as electrode material. ORR in alkaline solu¬ 
tion shows a lower overpotential loss than in acidic 
solution; hence, higher specific power densities 
(mW/cm 2 ) can be generated in cells. 

Recently, some progress in the preparation of anion 
exchange membranes was reported, leading to the appli¬ 
cation of the PEFC concept to alkaline fuel cells. How¬ 
ever, performance in terms of power output and 
stability/lifetime is far below the acidic technology [13]. 

Molten Carbonate Fuel Cell 

As a high temperature fuel cell, which operates in the 
temperature range around 650 °C, the MCFC can use 
the waste heat of the fuel cell reactions for internal 
reforming of C-containing fuels (Fig. 10). No further 
processing of the reformed fuel is necessary. Reformed 
hydrogen is oxidized at the anode, consisting of 
a porous nickel electrode, under consumption of the 
C0 3 2- cation, which carries the current through the 
electrolyte, to H 2 0 and C0 2 . The anode reaction prod¬ 
uct C0 2 has to be recycled to the cathode compart¬ 
ment, where it is consumed during the ORR at a porous 
Ni-Oxide cathode to C0 3 2- and taken up by the elec¬ 
trolyte melt again, therefore balancing the C0 3 2- con¬ 
centration in the Li 2 C0 3 /K 2 C0 3 eutectic electrolyte, 
which is absorbed in a LiA10 2 matrix. 

MCFC systems in the power range of 250 kW have 
been delivered for on-site testing, using a variety of 
fuels, like biogas, reformed natural gas, and others [20] . 

Solid Oxide Fuel Cell 

The electrolyte in a solid oxide fuel cell is a thin ceramic 
body, which is an 0 2 -conducting oxide material. 
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Fuel Cell Types and Their Electrochemistry. Figure 10 

Scheme of the molten carbonate fuel cell, MCFC (Adapted from MTU CFC Solutions) 


0 2 ~ conduction is established by doping a stoichio¬ 
metric material like Zr0 2 by Y 2 0 3 , thereby generating 
0 2 ~ vacancies in the lattice. Around 8% of Y 2 0 3 are 
required to stabilize the cubic form of Zr0 2 at temper¬ 
ature of operation. A reasonable specific conductivity 
for this 8YSZ material is obtained in the range of and 
above 800 °C (Fig. 11). 

Electrode materials consist of a Ni/8YSZ cermet 
(mixture of ceramic and metal) as anode and 
La x Sr y Mn0 3 (perovskite) as cathode. 

Ohmic losses in the electrolyte can be lowered by 
designing cells with thin electrolyte layers, typically 
10 pm thick, supported on one of the electrode struc¬ 
tures, instead of utilizing a thicker, free-standing elec¬ 
trolyte layer. 

For many years, also a tubular concept had been 
developed by Siemens Westinghouse, addressing the 
challenges in matching the thermal expansion coeffi¬ 
cients of the different materials, namely for the electro¬ 
lyte and the electrodes. However, the further 
development of this concept had been abandoned. 
SOFC technology development mainly addresses sta¬ 
tionary co-generation of power and heat, at a power 
scale of 100 kWe and more (e.g., Topsoe Fuel Cell, 


On + 4e~ —> 2 O 2- 


i 


Electrolyte yp 2 


Cathode 


Anode 

2H 2 -> 4 H + + 4e“ 4 H + + 2 0 2 “ -> 2 H 2 0 


Fuel Cell Types and Their Electrochemistry. Figure 11 

Scheme of the planar solid oxide fuel cell, SOFC, and its 
electrochemistry (Adapted from L. Gauckler, ETH Zurich) 


Denmark), as well as small scale (1 kW e ) co-generation 
for private homes (e.g., Hexis Switzerland). 

Interestingly, low power application micro-SOFCs 
are developed as battery alternatives, utilizing materials 
which allow operation temperatures in the range of 
550 °C (One-Bat-Project, ETZ Zurich [21]). 

Future Directions 

After two decades of intensive fuel cell research and 
development, some of the described technologies have 

















3886 


F 


Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels) 


come close to the market. This is particularly the case 
for the PEFC technology developed worldwide by auto¬ 
motive industry. Market introduction is planned, 
according to various sources, by the year 2015. How¬ 
ever, still issues of cost and reliability remain and 
provide ample space for further research. Unfortu¬ 
nately, other technologies, e.g., the SOFC and MCFC 
technology, have recently seen some setbacks, due to 
the fact that some longstanding prominent advocates 
of fuel cell development have decided not to continue 
their effort (e.g., SOFC by Siemens, MCFC by Tognum 
(formerly MCFC solutions), others). 
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Glossary 

Anode Electrode for electrochemical oxidation reac¬ 
tions. In solid oxide fuel cells, hydrogen-containing 
fuels are oxidized by oxygen ions transported 
through an electrolyte to form water vapor or C0 2 
as the reaction products at this electrode. SOFC 
anodes may also act as fuel reforming catalysts 
when hydrocarbon-based fuels are supplied to the 
anodes. 

Cathode Electrode for electrochemical reduction reac¬ 
tions. In solid oxide fuel cells, oxygen in ambient air 
is reduced to oxygen ions at this electrode. 
Electrolyte Ionic conductor with negligible electronic 
conductivity, used as a membrane between an oxi¬ 
dation atmosphere and a reduction atmosphere. 
The essential material to construct electrochemical 
devices including fuel cells. 

Fuel flexibility While low-temperature fuel cells (such 
as polymer electrolyte membrane fuel cells) typi¬ 
cally use pure hydrogen gas (or hydrogen gas mixed 
with C0 2 ) as the fuel, high-temperature fuel cells 
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can directly use various kinds of fuels, such as 
carbon monoxide (CO), which is created by 
reforming hydrocarbons at the anode to produce 
hydrogen and carbon monoxide. 

Impurity poisoning Even though high-temperature 
fuel cells have good fuel flexibility with various 
kinds of fuels, some minor constituents (impuri¬ 
ties) coming from, for example, low-purity fuels, 
raw materials, and system components can also 
react with electrode materials or can be adsorbed 
on the electrode reaction sites, hindering electrode 
reactions. Such poisoning phenomena can lead to 
fuel cell performance degradation with time. 

Mixed ionic electronic conductor Materials with 
both high ionic conductivity and high electronic 
conductivity. Such materials are often attractive 
for electrodes because of extension of electrochem¬ 
ical reaction sites beyond electrode/electrolyte 
interface. 

Oxygen ionic conductor Materials with high oxygen 
ionic conductivity. Oxides with both high oxygen 
vacancy (or other charge carrier such as interstitial 
oxygen atoms) concentration and high oxygen 
mobility exhibit high ionic conductivities. 

Protonic conductor Materials with high protonic 
conductivity. In the presence of oxygen vacancies, 
water vapor may be dissolved into oxides forming 
protons and exhibiting high protonic conductivity. 

Definition of the Subject 

While we may realize commercialization of SOFC 
systems in the very near future, continuous challenges 
will be needed to develop advanced fuel cells with 
higher efficiency, higher durability, better flexibility, 
as well as lower production cost. For realizing next- 
generation SOFCs, we may apply a broad range of 
knowledge related to (1) electrolyte and (2) electrode 
materials for cell development as well as (3) explore 
alternative fuels. This entry gives an overview on pos¬ 
sible alternative approaches for these three important 
aspects to realize advanced high-performance SOFCs 
in a future generation. 

Introduction 

Fuel cells, environmentally compatible energy systems 
using hydrogen-containing fuels [1-5], are strongly 


desired as promising key technologies to realize 
a sustainable low-carbon society. Compared to internal 
combustion, electrochemical energy conversion by fuel 
cells can achieve much higher efficiency. While the 
direct electrochemical conversion of chemical energy 
to electricity using fuel cells has received significant 
attention for several decades, mass marketing of fuel 
cell systems is often limited by insufficient performance 
and/or inadequate durability in service. 

While the commercialization of fuel cell technolo¬ 
gies has just begun in the form of, for example, station¬ 
ary cogeneration power units mainly based on polymer 
electrolyte fuel cells for private houses, a broad variety 
of other application is strongly desired. Even though 
solid oxide fuel cells (SOFCs) are the most efficient type 
of fuel cells, partly due to their higher operational 
temperature compatible with internal reforming of 
hydrocarbon-based fuels, their application has often 
been restricted by the limited operational range of 
electrolyte and electrode materials [1-3]. Although 
many current research projects tend to focus on engi¬ 
neering and optimization issues related to cost reduc¬ 
tion, durability, and reliability [6-8], in order to take 
SOFCs closer to full-scale commercialization, break¬ 
throughs in materials and concepts are needed for 
developing viable fuel cells for broader application. It 
is therefore essential to consider alternative materials 
for electrolytes and electrodes and innovative fuel cell 
concepts including the use of alternative fuels for 
future-generation fuel cells. For example, alternative 
electrolyte/electrode materials for lower-temperature 
SOFCs are needed to improve electrical efficiency in 
low-power operation, tolerance against thermal 
cycling, as well as improve longer-term durability. 
They will help realize alternative fuel cells in real com¬ 
mercial systems. Additionally, morphological control 
on the nano-scale shows promise in enhancing ionic 
and electronic transport in nano-ionic systems [9-14], 
offering an additional degree of freedom in material/ 
device design. A reversing challenge is to harness these 
effects to the fabrication of practical SOFCs with 
improved performance. 

Alternative device concepts can help realize broader 
application of SOFCs. SOFCs for automotive applica¬ 
tions will enable multi-fuel vehicles capable of being 
powered by both hydrogen and hydrocarbon fuels. The 
use of biogas can result in carbon-neutral energy 
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conversion. The use of hydrocarbon fuels and gasified 
coal gas fuel for SOFCs will achieve more efficient 
use of such fossil fuels. Portable SOFCs operating 
with hydrogen, hydrocarbon fuels [15, 16], and solid 
carbon [17] are attractive as replacements for batteries. 
Reversible SOFCs can act as both power units and 
electrolyzers, efficiently converting excess heat and 
electricity into hydrogen and vice versa [18-20]. If 
these technologies with SOFCs are realized, a consid¬ 
erable reduction in C0 2 emissions is possible in house¬ 
hold, industrial, and transportation sectors. 

It is therefore interesting and essential to consider 
such material/device issues in SOFC R8d) to realize 
future flexible SOFCs with broader application. In 
this entry, an overview will be given on alternative 
approaches to SOFCs related to the following three 
critical material-related issues: electrolytes, electrodes, 
and fuels. 

Alternative Electrolyte Materials 

Electrolyte Materials: General 

As the ionic conductivity of solid electrolytes is gener¬ 
ally much lower than the electronic conductivity of 
electrode materials, the ionic conductivity of the solid 
electrolytes may determine the lowest possible opera¬ 
tional temperature of SOFCs. Apart from high ionic 
conductivity, the following properties are required: 

1. High ionic transference number around unity 

2 . Thermochemical stability in both oxidation and 
reduction atmospheres 

3. Near theoretical density for minimum gas leakage 

4. High compatibility with other materials in compos¬ 
ite device 

5. Match of thermal expansion coefficient with other 
components with minimal defect-induced chemi¬ 
cal expansion 

In addition, high mechanical strength and reason¬ 
able materials cost are required in the case of electro¬ 
lyte-supported cells. 

There are limited numbers of materials satisfying 
the above-mentioned criteria. Generally, Y 2 0 3 - 
stabilized Zr0 2 (YSZ) is commonly used at high tem¬ 
peratures, that is, T > 800° C to maintain ionic con¬ 
ductance, while thin-film Zr0 2 -based electrolytes may 


be applied for operational temperatures around 700° C. 
For much lower operational temperatures, alternative 
electrolyte materials are needed. Ionic conductivity 
depends strongly on the crystal structure, and oxides 
with high ionic conductivity have been found among 
open crystal structures, such as fluorite, pyrochlore, 
rare-earth, perovskite, and related structures [21, 22]. 

Oxygen Ionic Conductors 
Fluorite-Type Materials 

General The fluorite structure is relatively open, and 
it shows exceptional tolerance for high levels of atomic 
disorder, which may be introduced either by doping, 
reduction, or oxidation [23] . Among the binary oxides, 
Ce0 2 , Pr 6 O n , Th0 2 , U0 2 , and Pu0 2 possess this 
structure in the pure state in ambient conditions. 
Zr0 2 and Hf0 2 are stabilized in the fluorite structure 
by doping with divalent or trivalent oxides (Y, Sc, rare- 
earth oxides) [23-25]. Beyond stabilization, the 
addition of such dopants gives rise to the creation of 
oxygen vacancies responsible for the ionic conduction 
in these oxides. 

Several fluorite-type oxides are known to be excel¬ 
lent oxygen ionic conductors, including 5-Bi 2 0 3 , Ce0 2 , 
Zr0 2 , Th0 2 , and Hf0 2 . The 5-Bi 2 0 3 phase possesses an 
anion-deficient fluorite structure where one-fourth of 
the normal fluorite anion sites are vacant. This phase 
exhibits an order to disorder transition above ~600°C 
(which can be suppressed to some extent through dop¬ 
ing) with a consequent increase in conductivity and has 
the highest oxygen ionic conductivity so far reported 
for a solid electrolyte [21, 22]. However, 5-Bi 2 0 3 shows 
poor thermo chemical stability, and it is easily reduced 
at high temperatures. Th0 2 is a radioactive material. 
Hf0 2 has chemical similarity to Zr0 2 but is relatively 
expensive. Practically available fluorite-type oxides for 
solid electrolyte are thus limited to Zr0 2 and Ce0 2 . 
Most of the rare-earths, as well as Sc, In, and Y, are 
trivalent and form sesquioxides of various crystal struc¬ 
tures with potential for solid electrolyte membranes. 
Most of them have the cubic C-type structure (Sm 2 0 3 , 
Dy 2 0 3 , Er 2 0 3 , Sc 2 0 3 , Y 2 0 3 , and ln 2 0 3 ) which is closely 
related to the fluorite structure [23], which can be 
derived from the fluorite structure by removing one- 
fourth of the oxygen atoms which lie along the four 
< 111 > directions [23]. SrO-doped La 2 Q 3 exhibits 
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ionic conductivity close to the Y 2 0 3 -doped Zr0 2 elec¬ 
trolyte [21, 22]. However, most of these sesquioxides 
are mixed ionic and electronic conductors with 
a substantial electronic conductivity [23, 26-29]. 

Zr0 2 -Based Oxide Pure Zr0 2 exhibits phase changes 
from monoclinic to tetragonal and cubic, with 
increasing temperature. Due to its volume change of 
ca. 5% in the phase transition from monoclinic to 
tetragonal, pure Zr0 2 cannot be used as a practical 
solid electrolyte material. Group 2 elements of the 
periodic table or rare-earth elements may be added to 
stabilize the cubic phase at high temperatures and keep 
this phase stable also at lower temperatures. As for 
stabilized Zr0 2 with dopants from the group 2 
elements, CaO-stabilized Zr0 2 is widely used mainly 
for Nernst-type oxygen sensors [30], but it is not used 
for fuel cells because of its relatively low conductivity 
[31] . On the other hand, stabilized Zr0 2 using dopants 
from rare-earth elements such as Y 2 0 3 , Sc 2 0 3 , Yb 2 0 3 , 
ln 2 0 3 , and Er 2 0 3 [21, 22, 32, 33] exhibits higher 
conductivity suitable for SOFC [34-38]. Ionic 
conductivity (intragrain conductivity) of Zr0 2 doped 
with (a) Y 2 0 3 , (b) Yb 2 0 3 , (c) Er 2 0 3 , (d) ln 2 0 3 , and (e) 
Sc 2 0 3 measured by impedance spectroscopy is shown 
in Fig. 1 [33]. In addition, activation energy values for 
the intragrain ionic conductivity of these doped Zr0 2 , 
cr ion ( S/m) x T(K) = <7 0 (S/m) x exp(-A E a (eV)/fcT), 
are compiled in Table 1. 

Yttria-stabilized Zr0 2 , discovered by Nernst [39], is 
still one of the state-of-the-art SOFC electrolyte mate¬ 
rials which was used to demonstrate the first SOFC 
(and the first solid “electrolyte” fuel cell) in 1937 at 
ETH-Zurich [40]. Electronic defect concentrations are 
negligibly low [41]. As can be observed in Fig. 1, the 
ionic conductivity of fluorite-type oxides stabilized 
with hypovalent elements exhibits a maximum at 
a certain dopant concentration above which defect inter¬ 
actions occur [42-45]. As shown in Fig. 2, it is known 
that the peak value of the ionic conductivity becomes 
higher when the ionic radius of the dopant approaches 
that of Zr 4+ [35], resulting in minimal distortion of the 
fluorite lattice. Ionic conductivity is the highest in 
the case of Sc 3+ , which is used as a dopant because its 
ionic radius is almost identical to that of Zr 4+ [46]. 
However, Sc 2 0 3 is more expensive than Y 2 0 3 , but 
Sc 2 0 3 -stabilized Zr0 2 (ScSZ) has recently become 


more widely used. In addition, ScSZ with 8 mol% 
Sc 2 0 3 exhibited a high conductivity in the early stages 
of operation, but the conductivity decreased with oper¬ 
ational time due to phase instability [47]. To prevent 
this decrease in ionic conductivity, ScSZ with 11 mol% 
Sc 2 0 3 was preferably used, but this material had 
a problem of phase transition in the vicinity of 700° C 
[35, 48]. Currently, ScSZ with the addition of Ce0 2 , 
such as (Zr0 2 )o.89(Sc 2 0 3 )o.i(Ce0 2 ) 0 .oi> is widely used 
to suppress this phase transition [46, 49]. 

In addition, tetragonal Zr0 2 and partially stabilized 
Zr0 2 can also be used as an electrolyte due to their 
better mechanical properties and their lower activation 
energy for ionic conductivity. Since the grain size of the 
tetragonal phase is generally much smaller than that of 
the cubic phase and grain boundaries impede oxide ion 
migration, the reduction of grain boundary resistivity 
is important [50]. Stability of the tetragonal phase for 
longer time periods should be considered to prevent 
phase transition from the tetragonal phase to the 
monoclinic phase when applying this material to 
SOFC systems. 

Thin-film technology can also help to realize lower 
operational temperatures in SOFCs by taking advan¬ 
tage of short transport distance of ions. It is also pos¬ 
sible to produce thin-film electrolytes using stabilized 
Zr0 2 , which enables lower-temperature operation of 
SOFCs because of its stability in both the oxidation and 
reduction atmospheres, its sinterability, and its high 
mechanical strength. Ultrathin YSZ electrolyte layers 
can be prepared on Si or glass ceramic substrates 
[15, 16, 51], which can realize, for example, lower 
temperature operation of micro-SOFCs for portable 
applications. Figure 3 shows an SEM cross-sectional 
view of a micro-SOFC prepared on a glass ceramic 
substrate with a YSZ electrolyte thickness of less than 
1 pm, as reported by Bieberle-Hiitter et al. [51]. 

Stabilized Zr0 2 exhibits good chemical compatibil¬ 
ity with other materials during SOFC operation below 
1,000°C. It is possible that stabilized Zr0 2 solid elec¬ 
trolyte reacts with other materials during the cell fab¬ 
rication processes which often require high 
temperatures in excess of 1,300°C. YSZ does react 
with Fa!_ x Sr x Mn0 3 _§ (FSM), the state-of-the-art 
cathode material, to produce Fa 2 Zr 2 0 7 or SrZr0 3 at 
temperatures above 1,100°C, with a consequential 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 1 

Intragrain conductivity of Zr0 2 doped with (a) Y 2 0 3/ (b) Yb 2 0 3 , (c) Er 2 0 3/ (d) ln 2 0 3/ and (e) Sc 2 0 3 measured by impedance 
spectroscopy, where t and c denote tetragonal and cubic phases [33] 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Table 1 Activation energies and 
pre-exponential factors of ionic conductivity of Zr0 2 doped 
with trivalent metal oxides (Y 2 0 3/ Yb 2 0 3/ Er 2 0 3/ ln 2 0 3 . 


Sc 2 0 3 ) [33], based on the relation o- ion (S/nn) x T(K) = cr Q (S/ 
m) x exp(—A E a (eV)//cT), where t and c denote tetragonal 
and cubic phases 


Dopant 

mol%(phase) 

Intragrain conductivity 

AEa(eV) crO 

yo 15 

5.8 (t) fc 

0.91 

5.11 x 10 7 


11.2 (t') 

1.04 

5.49 x 10 s 


12.0 (t') 

1.06 

6.93 x 10® 


12.7 (t') 

1.10 

1.34 x 10 9 


13.5 (c) 

1.09 

1.20 x 10 9 


14.3 (c) 

1.12 

1.69 x 10 9 

Ybi.s 

14(t') 

1.07 

1.43 x 10 9 


16(c) 

1.13 

3.66x10 9 


20 (c) 

1.19 

3.74x10 9 

ErO,.5 

14 (t') 

1.02 

3.86 x 10® 


16(c) 

1.20 

7.51 x 10 9 


20 (C) 

1.25 

7.74 x 10 9 

In0 15 

15 (t) 

0.80 

6.39 x 10 6 


15 (t') 

1.10 

2.99 x 10® 


17.5 (t') 

1.28 

4.29 x 10® 


25 (c) 

1.35 

7.51 x 10 9 


25 (C) fc 

1.22 

5.26 x 10 9 


35 (c) 

1.56 

3.47 x 10 10 


35 (c) fc 

1.14 

1.57 x 10® 


45 (C) 

1.48 

8.22 x 10 9 


45 (c) fc 

1.26 

1.83 x 10 9 

Sc0 15 

1 2 (t') 

1.20 

6.94 x 10 9 


14 (t') 

1.24 

1.20 x 10 10 


16(c) 

1.40 

2.44 x 10" 


fc furnace cooling at a rate of 10°C/min. Others : air quenching 



y3+ 

Dopant ion radius (A) 


0.30 

0.25 

0.20 

0.15 

0.10 


c n 


o 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 2 

Relation between dopant ionic radius, conductivity, and 
content of dopant, reported by Arachi et al. [35] 
(Reproduced by permission of Elsevier) 




increase in internal cell resistance [52-54]. In the 
case of LaCo0 3 - or LaFe0 3 -based oxide cathode mate¬ 
rials, reaction with stabilized Zr0 2 becomes worse, so 
that Ce0 2 -based oxides such as Gd 2 0 3 -doped Ce0 2 are 
generally used as a buffer layer between the Zr0 2 -based 


Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 3 

SEM cross-sectional view of a micro-SOFC prepared on 
a glass ceramic substrate, as reported by Bieberle-Hutter 
et al. [51] (Reproduced by permission of Elsevier) 
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electrolyte and such cathode materials to prevent inter¬ 
facial reactions from forming [3, 55]. 

Ce0 2 -Based Oxide Ce0 2 -based oxide, partially 
substituted by a hypovalent rare-earth oxide, exhibits 
high ionic conductivity in a wider range of 
compositions in comparison to stabilized Zr0 2 . This 
is because even pure Ce0 2 has the fluorite-type cubic 
structure rendering stabilization unnecessary. Like 
Zr0 2 , ionic conductivity of Ce0 2 also varies with the 
substitution of elements. It shows high ionic 
conductivity when it is partially substituted with 
alkaline-earth elements such as Ca and Sr [56, 57], 
and when partially substituted with a rare-earth 
element, even higher conductivity. As shown in Fig. 4, 
there is a systematic trend of the conductivity with 
ionic radius of the substituting element giving the 
highest conductivity with Sm [58]. Pure Sm 2 0 3 has 
a rare-earth C-type structure, and the similarity of 
structures between Sm 2 0 3 and Ce0 2 may lead to high 
stability. High ionic conductivity can be obtained in 
a wide range of substitution ratios with Sm 2 0 3 . 



Radius of dopant cation/nm 

Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 4 

Relationship between the dopant ion radius and the ionic 
conductivity of doped Ce0 2 at 800°C, as reported by 
Eguchi et al. [58] (Reproduced with permission from 
Elsevier) 


Sm 2 0 3 -doped Ce0 2 , usually abbreviated as SDC, with 
20 mol% partial substitution of Sm 2 0 3 , exhibits the 
highest ionic conductivity. In addition, Ce0 2 partially 
substituted with Gd 2 0 3 is also widely used as an 
electrolyte and, as previously mentioned, interlayer 
between the stabilized Zr0 2 electrolyte and the 
cathode, although it shows slightly lower conductivity 
than SDC [59]. More recently, double doping with Sm 
and Nd exhibits one of the highest ionic conductivities 
reported for ceria [60]. 

When doped Ce0 2 is applied as a solid electrolyte 
for SOFCs, a change in valence state of Ce 4+ to Ce 3+ in 
a reducing atmosphere at the anode results in two main 
drawbacks. First, it causes n-type electronic conduction 
in the solid electrolyte which leads to a cell voltage 
drop. Precisely speaking, when Ce0 2 is reduced, both 
oxygen vacancy and electron are introduced simulta¬ 
neously. Since the mobility of electrons is much higher 
than that of oxygen ions, the transference number, that 
is, the fraction of ionic conductivity from the total, 
decreases from unity. Second, when lattice oxygen is 
removed by reduction, the lattice expands, known as 
chemical expansion. A steep oxygen potential gradient 
appears in the Ce0 2 electrolyte from reducing anode to 
oxidizing cathode under the operating conditions of 
SOFCs, and a large mechanical stress in the electrolyte 
can occur resulting in fracture of the cell [61]. 

These issues can be overcome partially by using 
stabilized Zr0 2 as an interlayer between the anode 
and the electrolyte [62-65]. However, since there is 
a thermal expansion mismatch between the Zr0 2 - 
based solid electrolyte and the Ce0 2 -based interlayer, 
this often causes microscopic fracture [64, 65]. GDC 
with an addition of Pr (Ce 0 . 8 Gd 0 .i 8 P r 0 . 02 O 2 - 5 ) showed 
enhanced reduction resistance but with the same level 
of ionic conductivity as SDC [66]. In addition, there 
have been some attempts to realize lower-temperature 
operation by fabricating ultrathin membranes of Ce0 2 
electrolyte materials which contribute to reduce electri¬ 
cal resistance. Electronic conductivity, caused by reduc¬ 
tion, decreases with decreasing temperature, but ionic 
conductivity also decreases simultaneously. Although 
making thinner solid electrolytes is accompanied with 
certain difficulties, Ce0 2 -based electrolytes can be 
operated at lower temperatures below 600° C, where 
the influence of electronic conduction caused by 
the reduction may not be a fatal problem [67]. 
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For example, Oishi et al. [67, 68 ] have reported stable 
operation of metal-supported Ce0 2 -based SOFCs, 
composed of a porous ferritic stainless steel support, 
a Ni-Ce 0 . 8 Y 0 - 2 O L9 anode, a thin Ceo. 9 Gdo.iOi .95 elec¬ 
trolyte, and a La 0 . 6 Sro. 4 Coo. 2 Feo. 80 3 cathode, over 
1,200 h at 600°C. Ce0 2 -based electrolyte with a thick¬ 
ness of about 10 pm allows operation of SOFCs at 
rather low temperatures and selection of ferritic steel 
as a support material, resulting in enhancement of 
mechanical strength of cells, improvement of redox 
stability of cell stacks, and minimization of the use of 
expensive electrode and electrolyte materials [67, 68 ]. 
Considering transport in thin-film Ce0 2 -based elec¬ 
trolytes, nano-size effects [ 12 ] and mixed conduction 
[62] should be taken into account to tailor transport 
phenomena in SOFCs. 

Pyrochlores and Other Fluorite-Type Oxides (Y, Nb , Zr) 
0 2 s Pyrochlore-type oxides have the general 
formula A 2 3 + B 2 4 + 0 7 in which A is a rare-earth 
element such as Gd or Y, and B is Ti or Zr. Gd 2 Zr 2 0 7 
is the typical composition of pyrochlore-type oxides 
and can be considered as fluorite-type in which ionic 
defects are regularly arranged. The defect structure and 
the mixed conductivity can be controlled by the value 
of x in, for example, Gd 2 (Zr x Ti!_ x ) 2 0 7 , which is 
abbreviated as GZT [69]. When the ionic radius of 
rare-earth elements for the A site is larger than that of 
Gd, the structure changes from highly defective fluorite 
to pyrochlore [70, 71]. 

In the Gd 2 Ti 2 0 7 series, relatively high ionic con¬ 
ductivity is obtained at the 20 mol% substitution of Gd 
sites with Ca [72], though its use as an SOFC solid 
electrolyte is precluded by its electronic conductivity in 
air, with an ionic transference number lower than that 
of YSZ [73]. The thermal expansion coefficient of 
Gd 2 Ti 2 O y (10.4-10.8 x 10 6 K _1 ) [73, 74] is close to 
that of LSGM (10.4 x 10 6 K -1 ) [75] and lOScSZ (10.9 
x 10 6 K _1 ) [76]. 

In addition, the solid solution of Y 4 Nb0 8 . 5 and 
(Y, Nb, Zr)0 2 _§ also has the fluorite-related structure, 
and these materials have stimulated research interest 
due to their oxide ion conduction [77, 78]. Electrical 
conductivity of Y 4 Nb0 8 . 5 is independent of the oxygen 
partial pressure, and it is recognized as an ionic 
conductor [78]. The ionic conductivity is low, 
enhancement of conductivity by the addition of Zr 


was reported, and further improvement of ionic 
conductivity may be expected [77]. 

Perovskite-Related Oxides 

General Perovskite-type oxides have the general 
formula AB0 3 . The total valence of the A metal and 
the B metal ions is 6 , and various combinations are 
possible. However, there is a limit to the ionic radius (r) 
of the constituent metal ions (A and B) and the oxygen 
ion (O), as the tolerance factor t (t = ^ A R +I ^ ( W ) 

should satisfy 0.75 < t < 1 [79, 80]. Oxides related to 
LaGa0 3 , LaA10 3 , and Ba 2 In 2 0 5 have been well 
examined as solid electrolyte materials for SOFCs. 
Takahashi and Iwahara [81] already showed in 1970s 
that selected perovskite solid solutions exhibited 
high ionic conductivity around 5 x 10 -3 S/m at 800° C 
[ 21 , 22 ]. 

LaGa0 3 -Based Oxides Oxygen ionic conductivity 
in perovskite-type-oxide structures has been 
extensively investigated by Takahashi and Iwahara in 
their pioneering and detailed studies [81]. High ionic 
conductivity was observed in the Ti series of perovskite- 
type oxides. Despite detailed studies, perovskite-type 
titanate showed poor ionic conductivity compared to 
that of YSZ. Furthermore, hole conduction was 
relatively high, so that the ionic transference number 
was less than unity in oxidizing atmospheres. They also 
investigated perovskite-type oxides in which trivalent 
cations were added to both the A site and the B site. 
LaCo0 3 and LaFe0 3 are known as such kinds of 
oxides, but these oxides are mixed conductors with 
predominately high hole conductivity. However, 
LaA10 3 , with trivalent metal ions on the B site, 
showed relatively high ionic conductivity [81]. 

Building on these results, Ishihara et al. [82] 
attempted to increase lattice cubic volume by applying 
Ga to the B site which has a slightly larger ionic radius 
than Al in the La-based perovskite oxides (Fig. 5). They 
added various dopants, and the highest oxygen ionic 
conductivity was obtained in the case where 20 mol% 
of Sr was added to the A site, which has an approxi¬ 
mately equal ionic radius to La [82]. As for the 
dopant on the B site, the highest ionic conductivity 
was obtained when 20 mol% of Mg was added to 
the B site (La 0 . 8 Sr 0 . 2 Ga 0 . 8 Mgo. 2 0 2 . 6 [83]), since Mg 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 5 

Comparison of the oxygen ionic conductivity of 
Lao. 9 Sro. 1 Gao. 8 Mgo. 2 O 3 with conventional oxide ionic 
conductors with fluorite structures, compiled by Ishihara 
et al. [82] (Reproduced with permission from American 
Chemical Society) 


is the only element which satisfies the criteria of 
both ionic radius and low valence number. 
La a8 Sr 0 2 Ga a8 Mg 0 2 0 2 .6 (LSGM) shows the same level 
of ionic conductivity as GDC at temperatures above 
650° C. Electronic conductivity does not become pre¬ 
dominant within a wide range of atmospheres from 
pure oxygen to hydrogen-containing gases [84, 85]. 

While LSGM exhibits superior properties as an 
SOFC solid electrolyte, a phase transition from pseudo 
cubic (monoclinic) to tetragonal was observed at 
temperatures around 700°C. In order to give high 
ionic conductivity to this material even below 
650°C, the B site was partially substituted with Co, 
which has a similar ionic radius to Mg (La 0 8 Sr 0-2 Ga 0 . 8 
Mg 0 .ii5Coo.o850 3 _5, denoted as LSGMC) [84]. The 
crystalline phase at high temperature was then stabi¬ 
lized, and the phase transition did not occur at tem¬ 
peratures around 700° C, resulting in higher ionic 
conductivity than that of GDC within a wide temper¬ 
ature range from 500°C to 1,000°C [85] . It was revealed 
that oxygen ions tend to be trapped around Mg in 


LSGM [86], and the addition of Co on the B site is 
effective for preventing such trapping at lower 
temperatures. 

When an LaGa0 3 -based oxide is applied as a solid 
electrolyte for SOFCs, reactivity and/or interdiffusion 
with electrode materials becomes a critical issue. When 
LSM was used as the cathode material, interdiffusion of 
Mn into LSGM was observed [87]. When LSCF was 
used as the cathode material, interdiffusion of Co and 
Fe into LSGM was also seen [88-90]. A solid-state 
reaction also occurs at the interface between the Ni- 
based anode and the LSGM electrolyte due to the 
similarity of Ni to Co and Mn in the electrolyte mate¬ 
rials [91]. There have been some attempts to prevent 
such solid-state reactions, for example, by optimizing 
deposition procedures and by applying a Ce0 2 
interlayer between the LSC cathode and the LSGM 
electrolyte. 

Ba 2 ln 2 0 5 -Based Oxides The brownmillerite-type 
oxide has the general formula A 2 B 2 0 5 , with the 
crystal structure presented in Fig. 6. Alkaline-earth 
metal ions such as Ba occupy the A site, and trivalent 
typical metal ions such as In occupy the B site [92-99]. 
At temperatures below 1,140 K, it behaves as a mixed 
conductor which shows both p-type electronic 
conductivity and oxygen ionic conductivity due to its 
oxygen excess [92]. When it is heated to 1,140-1,230 K, 
ionic conductivity increases, which is associated with 
the phase transition to the disordered perovskite 
structure, as shown in Fig. 7 [93]. The ionic 
transference number also rises drastically to unity. 

For example, in the case of Ba 2 In 2 0 5 , the typical 
brownmillerite-type oxide, the disordered perovskite 
structure can be stabilized, and ionic conductivity in 
an intermediate-temperature range can be improved by 
substituting In with another highly charged cation such 
as Zr, Hf, and Sn [94, 95]. A similar effect can be 
obtained by the substitution of Ba, as shown in Fig. 8 
[93, 96]. 

Ba 2 In 2 0 5 is stable in the absence of C0 2 but decom¬ 
poses in the presence of C0 2 in a reducing atmosphere 
(p (0 2 ) = 10 _9 -10 -13 atm), accompanied by the for¬ 
mation of BaC0 3 . However, in a strongly reducing 
atmosphere of much lower oxygen partial pressure, 
In 3+ is reduced to In + , and the brownmillerite structure 
cannot be maintained anymore [92, 95, 97-99]. 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 6 

Crystal structure of Ba 2 ln 2 0 5 {Green: Ba, Blue: In, Red: O) 


Apatite-Type Oxides 

General Apatite is a generic name for minerals with 
the composition A 10 (MO 4 ) 6 X 2 [100-102]. Generally, 
the A site can be occupied by alkaline, alkaline-earth, 
or rare-earth metal ions. The M site can be occupied by 
elements such as P, Si, and Ge which can have 
a coordination number of 4. The M site can be 
partially substituted with transition metal ions. The 
X site can be occupied by O, OH, F, Cl, and C0 3 , 
etc. Moreover, the A-site cations are divided into two 
kinds: one occupies the 4f site where the coordination 
number is 7, and the other occupies the 6h site where 
the coordination number is 9. Hydroxide apatite 
Ca 10 (PO 4 ) 6 (OH) is a typical example. It is the 
principal ingredient of human bone and is practically 
used for artificial bone [101-103]. 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 7 

Ionic conductivity of Ba 2 ln 2 0 5 , reported by Goodenough 
[93] (Reproduced with permission from Elsevier) 


Ld 9 . 33 +x^ 6 ^ 26 + 3 x /2 ( x — 0-0.67) La 9-33 Si 6 0 26 and 

La 10 Si 6 O 2 7 have often been studied for their 
crystallographic interest. It is already known that 
lattice volume will continuously decrease with 
the ionic radius of lanthanide ions from La to Lu 
[102, 104]. In the Re 9 33+x Si 6 0 26 system, oxides with 
Re = La-Dy belong to the hexagonal system apatite 
structure (P6 3 /m) and oxides with Re = Y-Yb to the 
monoclinic system apatite structure (I2/a). Specifically, 
LaioSi 6 0 27 is not regarded as the apatite phase. In the 
case of silicate and germanate, the oxides with 
the amount of La from 8 to 9.33 exhibit the apatite 
phase [105]. 

The crystal structure of this system is shown in 
Fig. 9. The La 6h site is fully occupied, while the La 4f 
site is not fully filled. La 9 33 Si 6 0 26 does not have 


interstitial oxygen ions, but the conductivity improves 
by partially substituting the La 4f site or by preparing 
a slightly La-deficient sample (La = 9.0) to introduce 
oxygen vacancies into the 2a site [ 105, 106]. For x > 0, 
oxygen ions are introduced into the interstitial site, 
and 1 mol of oxygen is introduced into the unit cell at 
La = 10 (x = 0.67). When the number of interstitial 
oxygen ions increases, the conductivity also increases 
[107]. The ionic conductivity in this system is strongly 
anisotropic, as its crystallographic structure is. A sin¬ 
gle-crystal conductivity measurement of Nd 9>33 Si 6 0 26 
has revealed that the conductivity along the c-axis is 
more than 10 times higher than that in the a-b plane 
[ 108] . The anisotropy of this conductivity disappears at 
around 800°C [109]. 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 8 

Comparison of the oxygen ionic conductivity of 
La 0 . 4 Ba 0 . 6 lnO 2 .7 with Lao. 9 Sro. 1 Gao. 8 Mgo. 2 O 2.85 and YSZ, 
reported by Goodenough [93] (Reproduced with 
permission from Elsevier) 

As solid electrolytes, the La/Si oxides with La > 9.33 
contain interstitial oxygen ions, which can conduct in 
the channel (2a site) along the c-axis. The conduction 
mechanism is quite different from that of other oxide 
electrolytes in which oxygen vacancies are the mobile 
charge carriers. For example, in La 10 Si 6 O 27 , the activa¬ 
tion energy of ionic conductivity via oxygen intersti¬ 
tials is lower than that of YSZ, and the ionic 
conductivity of this La/Si oxide is higher than that of 
YSZ at and below 923 K, as described in Fig. 10 [110]. 
Moreover, it is known that the transference number is 
unity, and that this material remains stable in humid¬ 
ified hydrogen and in C0 2 -containing atmospheres at 
and below 1,073 K, while p-type conduction can appear 
at and above 1,173 K [111, 112]. The ionic conductivity 
decreases, and its activation energy increases with 
decreasing ionic radius of the rare-earth metal ions in 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 9 

Crystal structure of La 10 Si 6 O 27 [ 110 ] {Green: La, Blue: Si, 
Red: O) (Reproduced with permission from Elsevier) 


the La site [113, 114]. The substitution sites of dopants 
should be taken into account in considering ionic con¬ 
ductivities of such materials [106, 107, 110, 115, 116]. 

These materials are based on inexpensive La 2 0 3 and 
Si0 2 and can be synthesized at relatively low tempera¬ 
tures, but cannot exhibit high conductivity and 
density without sintering at and above 1,873 K [117]. 
The thermal expansion coefficient of La 10 Si 6 O 27 is 
9.7-10.3 x 10~ 6 K _1 . This value is very close to that 
of YSZ (10.5 x 10“ 6 K -1 ) and GDC (11.8 x 10~ 6 K _1 ) 
[118, 119], indicating that similar oxide electrodes 
may be used. There are a few reports on fuel cell 
electrochemical performance using a La 0 . 9 Sr ai CoO 3 _5 
cathode with a Ni-SDC cermet anode [110, 116, 119]. 
The results show 50% better performance than that 
with Pt electrodes [119]. The interdiffusion of La and 
Sr in the electrode and in the electrolyte can be tech¬ 
nological issues, as observed also in the LSGM-electro- 
lyte-based cells. 

La 933+x Ge 6 0 2 6+3x/2 ( x = 0-2.67) It has been already 
reported that La/Ge oxides exhibit a phase transition 
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Comparison of the oxygen ionic conductivity of lanthanum 
silicate with Lao. 9 Sro.TGao. 9 Mgo. 1 O 3 and oxides with the 
fluorite structure [ 110 ] 
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from triclinic to hexagonal phase at around 1,000 K 
[120]. This phase transition disappears by 
substitution of La by Sr [121]. A similar phase 
transition is observed in La 2 Ge0 5 [122]. The ionic 
conductivity of La x Ge 6 0 12 + i. 5 X exhibits the highest 
value in the apatite region, while this result is not the 
same as in the lanthanum silicates [105, 119, 123]. Due 
to the volatility of Ge, La 2 Ge0 5 is difficult to synthesize 
in the stoichiometric ratio of La/Ge [124]. 

Protonic Conductors 

General In general, most ionic conductors currently 
used for solid electrolytes in SOFCs are oxygen ionic 
conductors. However, the use of proton-conducting 
oxides allowing for lower-temperature operation is 
anticipated to have a beneficial effect on durability at 
long-term operation. There are at least two approaches 
for this aim from the electrolyte materials selection 
viewpoint. One approach is to increase the oxygen 
ionic conductivity of the electrolytes, and another 
approach is to use a proton-conducting oxide which 
exhibits smaller temperature dependence (i.e., lower 
activation energy) of the ionic conductivity. 


There are many kinds of proton-conducting oxides 
widely known in the literature, for example, by Kreuer 
[125]. In a very early study, C. Wagner had already 
reported the presence of protons in stabilized Zr0 2 
[ 126] . Hydrated oxides such as W0 3 -nH 2 0, phosphates 
such as Zr(HP0 4 ) 2 , [3-alumina, P^-alumina, zeolites, 
and heteropoly acid are good examples of proton¬ 
conducting oxides [9, 125, 127-130]. Acid salts 
such as CsHS 0 4 are also considered as electrolyte 
materials for fuel cells operating at lower temperatures 
[131, 132]. 

Various perovskite-type oxides [133,134], as well as 
Ba 2 In 2 0 5 [135-137], and La 10 Ge 6 O 27 [120, 138, 139] 
exhibit protonic conductivity. Among these materials, 
the perovskite-type oxides show outstanding protonic 
conductivity. To mention a few, SOFCs with 
BaCe 0 . 8 Yo. 20 3 _5 solid electrolytes can be operated at 
low temperatures above 400°C [140, 141], and a power 
density of 100 mW/cm 2 at 500°C has been achieved 
(Fig. 11, [141]). 

Perovskite-Type Oxides Regarding protonic con¬ 
duction in perovskite-type oxides (AB0 3 ), Takahashi 
and Iwahara [81] discovered the interesting phenome¬ 
non that protonic conduction appeared in SrCe0 3 at 
high temperatures, which was associated with the pres¬ 
ence of oxygen vacancies [125]. They revealed that 
BaCe0 3 has high hydrogen solubility by using deute¬ 
rium absorption measurements [142]. This material 
exhibited mixed conductivity of oxygen ions and pro¬ 
tons, which was strongly affected by the Ba/Ce ratio 
[143, 144]. When the Ba/Ce ratio was more than 1.05, 
such oxides exhibited high protonic conductivity. 
When the Ba/Ce ratio was less than 1, it behaved as an 
oxygen ionic conductor, although the conductivity was 
relatively low [ 145] . They also investigated the effect of 
partial substitution in detail [146-153]. 

The most important reaction leading to the forma¬ 
tion of protonic defects at moderate temperatures is the 
dissociative absorption of water which requires the 
presence of oxide ion vacancies Vq . Those defects 
may be formed intrinsically by varying the ratio of 
main constituents (e.g., in Ba 3 Ca 1+x Nb 2 _ x 0 3 _5 [154] 
or Ba 2 YSn0 5 . 5 [155]), or they may be formed extrinsi- 
cally to compensate for an acceptor dopant R. In the 
case of perovskite-type oxides, one generally employs 
a substitution of up to 25% of the B-site cation by 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 11 

(a) l-V characteristics (closed symbols) and power density curves (open symbols), (b) ohmic loss and (c) electrode 
overpotentials (anode + cathode) of a H 2 -0 2 proton conductor thin film SOFC operated at 700-500°C, reported by 
Matsumoto et al. [141] (Reproduced by permission of Elsevier) 


a lower-valence cation (i.e., A(B 1 _ x R x )0 3 _§ in the fol¬ 
lowing denoted R:AB0 3 ). In order to form protonic 
defects, water from the gas phase dissociates into 
a hydroxide ion filling an oxide ion vacancy and 
a proton forming a covalent bond with a lattice oxygen. 
In Kroger-Vink notation, this reaction is written: 

H 2 0 + Vq + 0q^20Hq (1) 

by which two hydroxide ions substituting for oxide 
ions, that is, two positively charged protonic defects 
(OHq), are formed. The hydroxyl ions form dynamical 


hydrogen bonds with neighboring lattice oxygen, and 
proton transfer through these hydrogen bonds com¬ 
bined with the rotational diffusion of this defect allows 
the protonic charge to migrate [156]. 

For perovskite-type oxides, the proton solubility 
decreases, but the stability to C0 2 increases as the 
basicity of B-site ions decreases in the order of Ce —> 
Zr —> Sn —> Nb —> Ti [156]. The best compromise for 
high proton conductivity and chemical stability is 
found for Y-doped BaZr0 3 , but unfortunately, this 
material exhibits large grain boundary resistance, 
and it still reacts slightly with C0 2 under certain 
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conditions [157]. Chemical stability against C0 2 has 
been demonstrated recently with BaCe 0 3 Zr 0 5 Y 0 2 0 3 _§ 
by Fabbri et al. [158]. 

Proton concentration is proportional to the solu¬ 
bility of water into oxygen vacancies. Such a protonic 
conductor tends to lose water at high temperatures, 
which results in a decrease in the carrier density and 
also a decrease in conductivity. Figure 12 shows the 
amount of water dissolution measured by the gravi¬ 
metric analysis for protonic conductors [156]. The 
perovskite-type proton-conductive oxides have enough 
carrier density in the temperature range from ca. 300 to 
500° C. On the other hand, mobility changes with tem¬ 
perature according to the Arrhenius equation with 
activation energy approximately from 0.3 eV to 
0.6 eV. As a result, these protonic conductors exhibit 
their maximum conductivity at certain temperatures as 
shown in Fig. 13 [156]. 

It is extremely important that the ionic transference 
number is high enough to be used as a solid electrolyte 
in fuel cells. The Ce 4+ cation has a tendency to be 
reduced easily, so that electronic conduction caused 
by the reduction of Ce 4+ ions becomes a problem in 
the case of the Ce0 2 -containing electrolyte. BaCe0 3 - 
based oxides contain Ce ions, exhibiting certain 


electronic conductivity by reduction [157]. Transport 
parameters including activation enthalpies of hole and 
proton conduction have been reported [157, 159]. 

It is commonly believed that this is because the 
band gaps of the perovskite-type protonic conductors 
are considerably large, ranging from 4.5 to 5 eV or even 
larger [160]. On the other hand, hole conduction 
appears in an oxidizing atmosphere. However, the tem¬ 
perature dependence of the hole conductivity is larger 
than that of the protonic conductivity. Oxygen vacan¬ 
cies are filled with water dissolved in various oxides, as 
shown in Fig. 12, at temperatures below 500°C, leading 
to a high protonic conductivity in an oxidizing 
atmosphere. 

Alternative Electrode Materials 

Cathode Materials 

Current Cathode Materials The majority of investi¬ 
gations related to SOFC cathode materials have 
been concentrated on oxides with the perovskite 
structure [2-4, 161]. The oxides commonly used are 
SrO-doped LaMn0 3 with an A-site deficiency 
((Lai-xSrJ^yMnO^s; x = 0.1-0.5, 0 < y < 0.1). 
For example, La 0 86 Sr 0 14 Mn0 3 exhibits an electronic 




Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 12 

Normalized hydrogen isobars (pH 2 0 = 23 hPa) for different perovskites, reported by Kreuer [156]. For BaZr0 3 -based 
compositions, data for different hypovalent dopants (Y, Sc, Gd, In ) are included (Reproduced by permission of Annual 
Reviews) 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 13 

Ionic conductivity of proton-conductive perovskite type oxides, reported by Kreuer [156] (Reproduced by permission of 
Annual Reviews) 


conductivity of 1.4 x 10 4 S/m at 1,000°C [162]. The 
electronic conductivity increases with increasing alka¬ 
line-earth cation (Sr 2+ ) concentration up to x « 0.5. 
The increase in Sr content, however, is accompanied 
with an increase in thermal expansion coefficient [ 161— 
168]. The perovskite structure is very tolerant with 
changes in the ionic radius of the A- and B-site cations, 
allowing many possibilities to substitute host cations 
with various dopants. Related perovskite oxides have 
also been investigated. The A site in this perovskite can 
be substituted with other rare-earth oxides (e.g., 
Y^xCaxMnOa) [169-172] and with other alkaline- 
earth oxides (La^xCaxMnOa) [173-175]. This struc¬ 
ture can also accommodate a high concentration of 
cation vacancies, particularly on the A site as 
(La 1 _ x Srx)i_yMn0 3 [22, 176-179]. By substituting 


the B site, the electronic conductivity and other prop¬ 
erties such as nonstoichiometry, oxygen vacancy con¬ 
centration, and catalytic activity can be changed 
considerably [163, 180, 181]. The Mn cation has been 
substituted with Fe, Co, Ni, and Cu, which enhances 
the electronic conductivity [180, 182] . Transition metal 
cations can be substituted on the B site like in 
La 1 _ x Sr x Mn 1 _ y COy0 3 or in La 1 _xSr x Fe 1 _yCOy0 3 , 
exhibiting excellent ionic conductivity as well as high 
electronic conductivity [21, 182-190]. The electronic 
conductivity of several perovskite oxides is shown in 
Fig. 14 [180]. Those compounds may be favored for 
their high electronic conductivity, particularly for pla¬ 
nar fuel cell configurations, where in-plane electric 
current flow is essential in the relatively thin cathode 
layers. In recent SOFC systems under development, 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 14 

Electronic conductivity of SrO-doped LaB0 3 (B = Mn, Co, Fe) after Takeda et al. [1 80] (Reproduced by permission of ECS- 
The Electrochemical Society) 


Sr-doped LaMn0 3 is still widely used as the state-of- 
the-art cathode material for Zr0 2 -based electrolytes 
[2-4, 161, 191]. 

In addition, cathode materials with mixed ionic and 
electronic conductivities can expand the electrode reac¬ 
tion area, leading to a decrease in cathodic overvoltage 
[2, 161, 192]. The perovskite oxides exhibit excellent 
electronic conductivity, and in specific compounds and 
conditions, they possess high ionic conductivity. The 
ionic conductivity of La a6 Sr 04 MnO 3 measured by 
Steele et al. [193] was 10 -5 S/m at 900°C in air, which 
is 6 orders of magnitude lower than the ionic conduc¬ 
tivity of 8 mol% Y 2 0 3 -doped Zr0 2 electrolyte. There¬ 
fore, the SrO-doped LaMn0 3 , the state-of-the-art 
cathode material, is a poor mixed conductor [193]. 
However, the oxide becomes a mixed conductor 
in reduced state (at low oxygen partial pressure) 


[192, 194], as expected from the defect chemistry 
[22, 163] and electrocatalytic properties [195]. 

On the other hand, SrO- or CaO-doped LaCo0 3 
and LaFe0 3 are excellent mixed conductors in ambient 
atmosphere [22, 180, 186-189, 193], where ionic con¬ 
duction occurs via the oxygen vacancy mechanism 
[196]. Teraoka et al. [188] have reported that the 
La 1 _ x Sr x Co 1 _ y Fe y 0 3 _§ exhibits ionic conductivity 
ranging from 10 to 10 3 S/m, while the ionic conductiv¬ 
ity of doped Zr0 2 electrolyte is around 10 S/m. High 
ionic conductivity in La 0 .6Sr a4 CoO 3 has been con¬ 
firmed by Steele et al. to be 80 S/m at 900° C [193]. 
Ionic conductivity also increases by substituting the 
B-site cations with cations with a larger number of 3d 
electrons from Fe to Co and Ni [182], consistent with 
the increase in oxygen vacancy concentration [163]. 
The thermal expansion mismatch between the cathode 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Table 2 Thermal expansion coeffi¬ 
cients of perovskite oxides for SOFC cathode applications 
[33, 168, 184, 186] 


Materials 

Thermal 
expansion 
coefficient 
(x 10 6 /K) 

Temp, 
range (°C) 

Zr0 2 

Zr0 2 -8 mol%Y 2 0 3 

10.5 

25-900 

LSM 

Lao.85Sr 0 .i5Mn0 3 

11.7 

25-900 


Lao.5Sr 0 .5Mn0 3 

12.6 

25-900 


La 0 ggMn0 3 

11.2 a 

25-1,000 


La 0 . 94 Sr 0 . 05 MnO 3 

11. T 

25-1,000 


La 0 . 89 Sr 0 .i oMn 0 3 

12 . 0 a 

25-1,000 


La 0 .79Sr 0 .2oMn0 3 

12.4 a 

25-1,000 


La 0 .69Sr 0 .3oMn0 3 

12 . 8 a 

25-1,000 

LSC 

La 0 . 85 Sr 0 .i 5 CoO 3 

18.7 

25-900 


La 0 .7Sro.3Co0 3 

18.3 

25-900 


La 0 .5Sr 0 .5CoO 3 

22.0 

25-900 


La 0 . 3 Sro. 7 Co0 3 

15.6 

25-900 

LSCM 

La 0 . 85 Sr 0 .i 5Mno.8Coo.2O5 

11.8 

25-900 


La 0 . 85 Sr 0 .i 5 Mno. 5 Coo. 5 O 5 

14.6 

25-900 


Lao. 5 Sro. 5 Mno. 5 Coo. 5 O 5 

14.9 

25-900 


Lao.8Sro.2Mno.8Coo.2O5 

n.i b 

200-800 


Lao.8Sro.2Mno.6Coo.4O5 

13.7 b 

200-800 


Lao.8Sro.2Mno.4Coo.6O5 

16.5 b 

200-800 


Lao.8Sro.2Mno.2Coo.8O5 

17.0 b 

200-800 


La 0 .8Sr 0 .2CoO 3 

17.8 b 

200-800 

LSCF 

Lao.8Sro.2Coo.8Feo.2O5 

18.4 

25-900 


La0.2Sr0.8Co0.8Fe0.2O5 

24.0 

25-900 


La 0 .8Sr 0 .2CoO 3 

19.7 C 

100 - 1,000 


Lao.8Sro.2Coo.8Feo.2O5 

20.7 C 

100 - 1,000 


Lao.8Sro.2Coo.6Feo.4O5 

20 . 0 C 

100 - 1,000 


La0.8Sr0.2Co0.4Fe0.6O5 

17.7 C 

100 - 1,000 


La0.8Sr0.2Co0.2Fe0.8O5 

15.1 c 

100 - 1,000 


La 0 .8Sr 0 .2FeO 3 

12.9 C 

100 - 1,000 


a See Ref. [168] 
b See Ref. [184] 
c See Ref. [186] 


and the electrolyte, as compiled in Table 2, leads to 
insufficient contact and thus the detachment of cath¬ 
ode layers, which limits the SOFC applications of such 
mixed conducting cathode materials. 

Alternative Cathode Materials Perovskite oxides 
using smaller lanthanide elements than La, such as 
Gd!_ x Sr x Co0 3 [197], Sm^xSrxCoC^ [198], 
Pr^xS^MnOa, and Nd!_ x Sr x Mn0 3 [199] have been 
investigated as cathode materials in order to decrease 
reactivity with the YSZ electrolyte. Compatibility with 
YSZ electrolyte was further improved for 
Gd 1 _ x Sr x Co 1 _ y Mn y 0 3 [200] by substituting Co with 
Mn on the B site due to lower reactivity and matching 
of thermal expansion coefficients. Substitution of Co 
with Fe on the B site also decreases their thermal 
expansion coefficient. Perovskite oxides using Co and 
Fe for the B site, such as Pr 1 _ x Sr x Co 1 _ y Fe y 0 3 _§, 
Nd 1 _ x Sr x Co 1 _ y Fe y 0 3 _§, and Gd 1 _ x Sr x Co 1 _ y Fe y 0 3 , 
exhibit high electrical conductivity above 600° C as 
shown in Fig. 15 and exhibit good catalytic activity on 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 15 

Temperature dependence of the electrical conductivity for 
Ln 0 . 4 Sro. 6 Coo. 8 Feo. 20 3 _§ (Ln = La, Pr, Nd, Sm, Gd), reported 
by Tu et al. [201] (Reproduced with permission from 
Elsevier) 
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GDC electrolytes in the intermediate operational tem¬ 
perature range (600-800°C) [201, 202], as shown in 
Fig. 16. 

Sm a 5 Sr a 5 Co 03 has been used as a cathode material 
for the LSGM electrolyte, and a low cathodic 
overpotential of less than 50 mV at 0.5 A/cm 2 was 
obtained at 873 K [203]. BaCo0 3 was also investigated 
as a novel cathode material for the LSGM electrolyte. 
Ba 0 . 6 Lao. 4 Co 0 3 showed a high surface activity for oxy¬ 
gen dissociation and exhibited the same level of 
cathodic overpotential as Sm 0 5 Sr 0 5 Co0 3 as shown in 
Fig. 17 [204]. 


Perovskite oxides containing Ba in the A site were 
proposed as cathode materials for a lower-temperature 
operation. Many of them have been investigated in 
combination with Ce0 2 -based electrolytes. High 
power density of more than 1 W/cm 2 was obtained at 
600°C by using Ba 0 . 5 Sr 0 5 Co 0 . 8 Fe 0 . 2 O 3 _§ (BSCF) for the 
cathode on a thin SDC electrolyte (12-20 pm in thick¬ 
ness) as shown in Fig. 18 [205]. Enhancement of elec¬ 
trical conductivity by an addition of Sm to BSCF was 
also reported, which resulted in further improvement 
of cathode performance at low operational tempera¬ 
tures [206]. Ba 05 Sr 0 5 Co 1 _ y Fe y 0 3 _§ possessed low 




C log I/A cm 2 

Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 16 

Cathodic polarization curves for Ln 0 .8Sro.2Co 1 _ y Fe y 0 3 _5 ((a) Ln = Pr, (b) Ln = Nd, (c) Ln = Gd) cathodes on GDC electrolytes 
at 700°C, reported by Qiu et al. [202] (Reproduced with permission from Elsevier) 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 17 

Temperature dependence of cathodic overpotential of the 
Ba 0 .6Lao.4Co0 3 cathode on the LSGMC electrolyte as 
a function of current density, reported by Ishihara et al. 
[204] (Reproduced with permission from Elsevier) 


vacancy formation energy with low oxygen migration 
barrier [207]. However, BSCF cathode could degradate 
in the presence of C0 2 in ambient air, owing to the 
reaction with alkaline-earth elements, resulting in the 
formation of Ba and Sr carbonates [208, 209]. 

Regarding durability of SOFC cathodes, Cr poison¬ 
ing phenomena of cathode materials have been inves¬ 
tigated to better understand long-term cell 
performance. Taniguchi et al. clarified that not only 
the vapor pressure of Cr species but also the cathodic 
overvoltage affects the degradation rate for a cell using 
La 0 > 9 Sr 0 !Mn0 3 [210]. In addition, reaction between 
the Cr vapor and the cathode material can decrease 
the vapor pressure and contribute in delaying the 
beginning of cell performance degradation [ 211 ]. 
Matsuzaki et al. reported that the use of 
Lao. 6 Sro. 4 Coo. 2 Feo. 3 O 3 (LSCF) on Ceo.gSmo^Oi.g elec¬ 
trolyte drastically reduced the Cr poisoning in compar¬ 
ison with La 0 . 85 Sr 0 .i 5 MnO 3 [212]. LaNi 0 . 6 Fe 0 . 4 O 3 
(LNF) was also proposed as a novel cathode material, 
exhibiting high tolerance to the Cr poisoning. Chiba 
et al. have first reported that LNF possesses suitable 
properties for operating at low temperatures [213]. 
LNF has shown high electronic conductivity of 




Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 18 

Performance obtained from a Ba 0 .5Sro.5Coo.8Feo.20 3 _5(~20 |im)| Sm 0 .i5Ce 0 .85O 2 _5(~20 pm)| 

Ni + Sm 0 .i5Ce 0 .85O 2 -§(^700 pm) fuel cell, (a) Cell voltage and power density as a function of current density and (b) 
area-specific resistances of the electrodes, reported by Shao and Haile [205] (Reproduced with permission from Nature 
Publishing Group) 
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580 S/cm at 800° C and relatively closer thermal expan¬ 
sion coefficient of 11.4 x 10~ 6 K -1 (30-l,000°C) to 
that of YSZ than Lao. 8 Sr 0-2 Mn0 3 (LSM). Furthermore, 
it is also revealed that LNF has higher tolerance to Cr 
poisoning than LSM and LSCF [214]. Figure 19 shows 
the comparison of cathodic overvoltage by Cr poison¬ 
ing between LSCF and LNF. Concerning the Cr poi¬ 
soning phenomena, Jiang et al. [215] explained that the 
deposition of Cr in the LSM cathode is initiated by the 
nucleation reaction involving the gaseous Cr species 
and Mn 2+ , which are generated by the cathode polari¬ 
zation and regarded as nucleation agents. SrO is iden¬ 
tified as a nucleation agent in the case of LSCF. As 
a result, they concluded that LNF exhibits high toler¬ 
ance to Cr poisoning because it has no nucleation agent 
under any electrochemical condition [215]. In terms of 
electrochemical properties at low temperatures, an 
electrode resistance of approximately 0.1 Q cm 2 , 
corresponding to the cathode polarization, was 
achieved at 700° C in the case of a LNF-YSZ composite 
cathode which was made by infiltrating LNF into 
porous YSZ [216]. 

Another type of oxide with the general formulation 
of A 2 B0 4+ § (e.g., La 2 _ x Sr x Ni0 4+ § [217], Nd 2 Ni0 4+ § 
[218, 219], Pr : 4 Sr 0-6 NiO 4 [220], and Sm 0-5 Sr L5 CoO 4 _§ 
[221]) has also been investigated for intermediate- 
temperature (900-1,000 K) operation. Their K 2 NiF 4 - 
type structure allows for some slight oxygen 


overstoichiometry. Oxygen excess, accommodated by 
oxygen interstitials rather than cation vacancies, leads 
to high values of oxygen tracer diffusion coefficient. 
Current density of 0.5 A cm -2 at 0.7 V was obtained at 
700° C with an anode-supported half-cell with 
a Nd L95 Ni0 4+ § cathode and a YSZ electrolyte [219]. 

In order to enhance the catalytic activity of the 
cathode for oxygen reduction and thus to increase the 
electrochemical performance, an addition of noble 
metals has also been investigated [222] . By the addition 
of silver to a LSCF-Ce0 2 composite cathode, an 
improvement of cathode performance at lower temper¬ 
atures below 700°C was measured [223, 224]. Conse¬ 
quently, Ag improved the surface exchange step of 
oxygen from the gas phase into the oxide [223]. Cell 
performance of more than 500 mW/cm 2 at 0.7 V at 
700°C was obtained using anode-supported cells with 
LSCF-coated Ag cathode and YSZ electrolyte [225]. 
Generally speaking, Sr-doped LaMn0 3 and related 
materials are suitable for high-temperature operation 
of SOFCs, while more electrochemically/catalytically 
active alternative cathode materials may be needed 
especially for lower operational temperatures. 



Anode Materials 

Current Anode Materials In state-of-the-art SOFCs, 
Ni-based cermet anodes are commonly used, which 




Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 19 

Cathodic overvoltage (IR corrected) for cells using (a) LSCF cathode and (b) LNF cathode as a function of time: (•) cell with 
Inconel 600 at 0.7 A crrr 2 and (■) cell without Inconel 600 at 0.7 A cirT 2 , reported by Komatsu et al. [214] (Reproduced by 
permission of Elsevier) 
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exhibit excellent electrocatalytic activity for anodic 
electrochemical reactions, fuel oxidation, and internal 
reforming, as well as sufficient current collection [2-4, 
161, 226]. Due to the requirements of operation in 
reducing environments and high electronic conductiv¬ 
ity, pure metallic electrodes are generally used as 
anodes. Several metals such as Ni, Ru, Pt, and Co 
have been studied as anode materials [161, 227, 228]. 
Ni exhibited the lowest anodic overpotentials 
[161, 227], while Ru could be an excellent anode mate¬ 
rial [228]. Pure Ni has electronic conductivities of 1.4 
x 10 5 Scm -1 and 2.6 x 10 4 Scm -1 at 298 K and 900 K, 
respectively; a thermal expansion coefficient of ca. 14.6 
x 10 6 K -1 ; and a melting point of 1,455°C [229]. The 
transition metals, which show higher activities for the 
oxidation of hydrogen, tend to change their micro- 
structures at high temperatures under the reducing 
conditions of the anode, that is, coalescence of metal 
particles leads to anode densification. To stabilize the 
anode morphology in terms of porosity and thermal 
expansion coefficients, in general, cermets are used, 
where the ceramic component provides a rigid network 
and hinders the porous anode morphology from coa¬ 
lescence. The ceramic component also ensures to make 
the thermal expansion coefficient similar to that of the 
electrolyte materials. 

Internal reforming of hydrocarbon-based fuels can 
also occur on the Ni surface in the cermet anode. In an 
SOFC cermet anode, substantially long triple phase 
boundaries (TPB) can be achieved, and given that solid 
networks of two phases (electronic conductors and ionic 
conductors) are obtained throughout the anode volume. 
It has been well accepted that the electrochemical 
reaction occurs around the TPB, where oxygen ion con¬ 
ductor (electrolyte), electron-conducting metal phase 
(Ni), and gas phase all meet together [2, 161]. 

Alternative Anode Materials 

General While Ni/YSZ cermet is the most preferred 
anode material for SOFCs operating with hydrogen- 
containing fuels, there are several efforts to find 
alternative materials for the SOFC anode. Many 
studies on oxide anodes have been made for 
perovskite-related oxides. LaCr0 3 [230-237], LaFe0 3 
[238],LaTi0 3 [239], and SrTi0 3 [240-243] systems are 
considered as possible alternative anode materials. 


Perovskite-Type Oxide LaCr0 3 -based oxides are well 
known as interconnect materials for SOFCs. They have 
high durability in reducing atmosphere. The oxygen- 
deficient perovskite La 0 . 7 5 Sr 0 2 sCr 0 5 Mn 0 5 0 3 , used 
as an anode material by Tao et al. [230], is stable in 
both fuel and air atmospheres and has revealed 
stable electrode performance, as shown in Fig. 20 

[230] . However, this material has a low electronic 
conductivity in reducing anode atmosphere and is not 
stable to sulfur impurities in the fuel [231]. Its long¬ 
term performance has been proven unsatisfactory 

[231] . Sr 2 MgM 0 O 6 _ 8 exhibited a Sulfur-tolerance 
up to 50 ppm H 2 S [232]. Experimental results on 
the La 0 . 8 Sr 0-2 Cr 0-9 7 V 0 . 0 3 O 3 (LSCV)-YSZ composite 
anode have demonstrated that the electrochemical 
performance is comparable to that of Ni/YSZ after 
48 h operation [233]. 

Ruiz-Morales et al. [234] demonstrated symmetri¬ 
cal fuel cells (SFCs) using the same material 
L a o.75Sro.25Cro.5Mn 0 .50 3 _§ (LSCN1) on both the 
anode side and the cathode side. Due to its enhanced 
electrochemical properties under both reducing and 
oxidizing conditions, LSCM-based SFCs offered suffi¬ 
cient performances, 0.5 and 0.3 Wcm -1 at 950°C using 
H 2 and CH 4 as fuels, respectively [234]. 



Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 20 

Temperature (7) dependence of total conductivity (a) of 
Lao.75Sr 0 .25Cro.5Mno.50 3 _§ in air and 5% H 2/ reported by Tao 
and Irvine [230] (Reproduced with permission from Nature 
Publishing Group) 
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Sauvet et al. [235] have studied the electrochemical 
properties of La 1 _ x Sr x Cr 1 _ y Ru y 0 3 _§ (x = 0.2—0.4, and 
y = 0.02 and 0.05) for hydrogen and methane fuels at 
750 and 850° C. They found the best performances 
obtained with 30% of Sr doping, and the variation of 
the ruthenium content, approximately between 0.02 
and 0.05, had little influence on the electrochemical 
results. At 750°C, the polarization resistance was 
3.7 Q cm 2 in hydrogen and 40 Q cm 2 in methane for 
the YSZ/Ce0 2 /graded La 0 . 7 Sr 0>3 Cr 0- 95 Ru 0 . 05 O 3 -YSZ 
structure [235]. Sauvet et al. also studied 
La 1 _ x Sr x Cr 1 _ y Ni y 0 3 _§ compositions as novel anode 
materials. They have found that these materials have 
catalytic activity, but less than that of Ni composites 
[236, 237]. 

Lanthanum ferrites have also been investigated as 
potential anode materials. They are already known for 
high mixed electronic and ionic conductivity and good 
oxygen exchange kinetics. However, they have poor 
stability at lower oxygen partial pressures in the reduc¬ 
ing atmosphere. The stability, however, could be 
improved by mixing with GDC [238]. 

(La, Sr)Ti0 3 has been investigated as alternative 
SOFC anodes, along with other A-site-deficient lantha¬ 
nides. Nevertheless, it has poor conductivity than 
Ni/YSZ cermet [239]. Doped SrTi0 3 could be an inter¬ 
esting anode material, and the mixed conductivity of 
La-doped and Fe-doped SrTi0 3 is widely influenced by 
the chemical composition [240, 244, 245]. SrTi0 3 
doped with Y [241] and La [242] exhibits increased 
electrical conductivity. Unfortunately, the high- 
temperature electrical conductivity of various perov- 
skite anodes is still lower than that of the conventional 
Ni/YSZ cermet for the use as SOFC anodes [243]. 

Other Transition Metal Cermets The Cu-based anode 
cermet has been extensively studied. For example, Park 
et al. [246, 247] demonstrated that the anode 
composed of Cu/Sm 2 0 3 -Ce0 2 /YSZ was active for 
direct electrochemical oxidation of alkanes and 
alkenes, such as methane, ethane, butane, butene, 
hexane, and toluene, at the operating temperature of 
700° C. Ce0 2 plays an important role as a primary 
catalyst material for oxidizing hydrocarbons, and the 
main role of Cu is to provide electrical conductivity. 
Zhou et al. [248] reported good performance of 


Ce0 2 -Rh anodes for the direct electrochemical 
oxidation of waste vegetable oil, which led to stable 
current density of 500 mA cm -2 over 100 h under the 
condition of 0.5 V and 700°C. 

Alternative Fuels 

Fuel Flexibility 

SOFCs are the most flexible fuel cells with respect to the 
type of fuels [1, 246-255] which can be supplied 
directly to the anodes. This is partly due to their high 
operational temperatures suitable for internal 
reforming within the anodes. Because of their 
multi-fuel capability, not only hydrogen and carbon 
monoxide [256] but also various types of fuels such as 
hydrocarbons, alcohols, and biogas could be used via 
internal reforming and/or via simple external 
reforming [246-249, 257-260]. Practical fuels com¬ 
prise natural gas (consisting mainly of CH 4 with 
a small amount of other hydrocarbons such as C 2 H 6 ), 
coal gas (consisting mainly of CO and H 2 ), liquefied 
petroleum gas (LPG, consisting mainly of C 3 H 8 with 
C 4 H 10 ), naphtha (consisting mainly of C 5 and C 6 
hydrocarbons), gasoline (consisting mainly of hydro¬ 
carbons with carbon numbers around 8), kerosene 
(consisting mainly of hydrocarbons with carbon num¬ 
bers around 12), diesel fuel (consisting mainly of 
hydrocarbons with carbon numbers around 14-16), 
alcohols, dimethyl ether, and biogas (consisting mainly 
of CH 4 , C0 2 , and H 2 0) [251, 252]. 

In any case, one of the most important issues to be 
prevented in SOFC systems is carbon deposition (coke 
formation) from the fuels. Figure 21 shows the equi¬ 
librium products for (a) methane- and (b) methanol- 
based fuels with the steam-to-carbon (S/C) ratio of 1.5 
at elevated temperatures [251]. Assuming thermo¬ 
chemical equilibrium, carbon deposition is not 
expected to occur within a wide temperature range. 
The calculated results for various other fuels men¬ 
tioned above have been shown elsewhere [251]. The 
minimum amounts of H 2 0 (water vapor) necessary to 
prevent carbon deposition are shown in Fig. 22 for 
hydrocarbon fuels. While S/C of 1.5 is enough for 
CH 4 , higher S/C is needed with increasing carbon 
number of hydrocarbons, especially at lower 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 21 

Equilibrium products from (a) methane- and (b) methanol-based fuels with the steam-to-carbon ratio of 1.5 


temperatures. Such dependencies have also been 
revealed for 0 2 (partial oxidation) and C0 2 (C0 2 
reforming) [251] to prevent carbon deposition. 

It has been found that the major equilibrium con¬ 
stituents in fuel gases are H 2 (g), H 2 0(g), CO(g), 
C02(g)> CH 4 (g), and C(s) [251]. Since their equilib¬ 
rium compositions depend solely on the C-H-O ratio, 
we can plot, on C-H-O diagrams [252, 253, 255], 
parameters relevant to optimize operational condi¬ 
tions, including carbon deposition region, gas partial 
pressures, and electromotive force. Figure 23 shows the 
positions of various fuel species for fuel cells [252]. 


Their positions can be shifted by adding H 2 0 (steam 
reforming), 0 2 (partial oxidation), and/or C0 2 (C0 2 
reforming) as co-reactants. Figure 24 shows the carbon 
deposition limit lines, the carbon-rich side of which 
corresponds to the carbon deposition region at each 
temperature. From Figs. 23 and 24, we see that an 
addition of co-reactants is necessary to prevent carbon 
formation at SOFC operational temperatures from 
most fuels shown in Fig. 23. Once the C-H-O ratio is 
specified, C-H-O ternary diagrams are useful to derive 
relevant operational parameters without any additional 
thermochemical calculations [252]. 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 22 

Minimum steam-to-carbon (S/C) ratio needed to prevent carbon deposition in thermodynamic equilibrium for 
hydrocarbons [251] (Reproduced with permission from ECS - The Electrochemical Society) 



C 



Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 23 

Various hydrocarbons and related species in the C-H-0 diagram [252] (Reproduced with permission from ECS - The 
Electrochemical Society) 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 24 

Carbon deposition limit lines at various temperatures in the C-H-0 diagram [252] (Reproduced with permission from ECS 
- The Electrochemical Society) 


Apart from such thermochemical processes, also, 
kinetic phenomena may dominate electrode perfor¬ 
mance. In the following sections, various practical 
fuels are experimentally examined, the use of which 
may enable broader application of SOFCs. 

Practical Fuels 

City gas (town gas) is used as the state-of-the-art fuel 
for the first-generation commercial SOFC systems. 
City gas, consisting mainly of CH 4 from the natural 
gas, may be partially pre-reformed before supplying to 
the SOFC systems. Figure 25 shows the comparison 
of the current-voltage (I-V) characteristics between 
a H 2 -based fuel and a typical SOFC fuel (50% pre¬ 
reformed CH 4 with S/C = 2.5) [261]. The cross section 
of a typical SOFC used for this electrochemical mea¬ 
surement (see Fig. 25) is shown in Fig. 26. This simu¬ 
lated practical SOFC fuel for distributed cogeneration 


systems consists of the one-to-one mixture of fully 
reformed CH 4 with S/C = 2.5 and non-reformed CH 4 
with S/C = 2.5. While too low S/C may result in carbon 
deposition, excessive amount of H 2 0 causes lower 
OCV and may cause re-oxidation of Ni at high fuel 
utilization of the SOFC systems. 

As practical SOFC fuels may contain a small 
amount of sulfur as an impurity, H 2 S poisoning should 
be considered with respect to various operational con¬ 
ditions [262-266] . As an example, Fig. 27 shows the cell 
voltage, anodic polarization, and the ohmic loss on the 
anode side, measured for 50% pre-reformed CH 4 - 
based fuel with S/C = 2.5 containing 5 ppm H 2 S. An 
initial cell voltage drop followed by a quasi steady-state 
cell voltage can be observed associated mainly with an 
increase in anodic polarization and almost constant IR 
loss, up to 3,000 h [262]. Exactly speaking, cell voltage 
generally decreased up to ca. 200 h and then remained 
almost constant but with a slightly higher degradation 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 25 

Comparison of the cell voltage and anodic losses in using 3% humidified H 2 and 50% pre-reformed CH 4 (5/C = 2.5) as fuels 
at 800°C. Anodic loss includes anodic overpotential and anode-side ohmic loss [261] (Reproduced with permission from 
Dr. Kengo Haga) 


rate of 0.68%/1,000 h with 5 ppm H 2 S, compared to the 
degradation rate of 0.3%/l,000 h without H 2 S impurity 
[262]. This result clearly shows that SOFC is more 
tolerant against sulfur poisoning compared to other 
types of fuel cells, giving more flexibility in practical 
fuel selection. Even so, it should be noted that carbon 
deposition phenomena can be affected by the simulta¬ 
neous presence of sulfur impurities and non-reformed 
hydrocarbons [262]. 

Poisoning mechanisms of sulfur, a typical common 
fuel impurity, have been extensively analyzed for vari¬ 
ous fuels, including H 2 , H 2 -CO, CH 4 , partially 
reformed CH 4 , and simulated biogas [263-266]. For 
relatively low concentrations (ppm level) of sulfur, 
reversible processes associated with adsorption/ 
desorption of sulfur have been considered as the pre¬ 
dominant mechanism [263-265] . At higher sulfur con¬ 
centration and/or lower operational temperature, an 
irreversible degradation, associated with the oxidation 
of Ni due to a large anodic polarization, has been 
observed [263, 264]. Sulfur poisoning to internal 
reforming reactions is also serious, so that much larger 
cell voltage drop has been observed in the case where 
CH 4 -rich fuels are supplied [265]. In addition, the 


C-H-O-Ni-S stability diagram suggests that the 
formation of Ni 3 S 2 (melting point 787°C) could be 
possible for hydrogen-poor fuels. Possible sulfur 
poisoning mechanisms are summarized elsewhere 
[262, 265]. 

Poisoning phenomena have been analyzed for other 
impurities, including chlorine, siloxane, phosphorus, 
and boron on the anode side. Figure 28 shows FESEM 
micrographs of anodes poisoned by these impurities of 
ppm levels [266] . Chlorine poisoning is associated with 
the sublimation of NiCl 2 [267], while siloxane poison¬ 
ing leads to silica deposition [266]. Phosphorus is 
reactive with Ni to form a eutectic compound [268]. 
The presence of boron accelerates the grain growth of 
Ni in the anodes [266]. 

Various other types of fuels have been examined as 
possible SOFC fuels. Direct feeding of hydrocarbon 
fuels has been reported including gaseous fuels such 
as methane [246, 247, 269], biogas [270], ethane [246, 
247], and butane [246, 247], and liquid fuels such as 
n-decane [271], gasoline [271], synthetic diesel [272], 
crude and jet fuel oils [273], vegetable oil [248], and 
biodiesel [274]. Generally speaking, the S/C ratio 
should be optimized to prevent carbon deposition for 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 26 

FESEM micrographs of typical cathode and anode. Electrochemical performance of this cell is shown in Fig. 25. The 
composition of each electrode layer is also shown [261] (Reproduced with permission from Dr. Kengo Haga) 


internal steam reforming, and the oxygen-to-carbon 
(O/C) ratio should be adjusted for internal partial 
oxidation. Fuel impurities in such practical fuels 
should also be taken into account to maintain cell 
performance and durability. 

Alcohols 

The use of alcohols as SOFC fuels is of technological 
interest, as alcohols (bioliquids) can be derived from 
biomass and can be easily reformed. In addition, 
direct-alcohol SOFCs could be an alternative to 
direct-methanol fuel cells, typically using polymer elec¬ 
trolytes at a low temperature [4]. Power generation 
characteristics for alcohol-based fuels and for the sim¬ 
ulated reformed gas (consisting mainly of H 2 , CO, and 
H 2 0) are shown in Fig. 29 [257]. All these alcohols 
were premixed with water, so that the C-H-O ratio was 


fixed to the same value (C:H:0 = 9:38:10 for all fuels). 
It has been demonstrated that direct-alcohol SOFCs are 
feasible, at least, for alcohols with carbon number up to 
4 (butanol) [257, 258]. In case that methanol was 
directly supplied, the I-V characteristics were similar 
to those for the simulated reformed gas. However, with 
increasing carbon number of alcohols, a decrease in cell 
voltage was observed. From the gas analysis by gas 
chromatograph, it has been revealed that the composi¬ 
tions of the simulated reformed gas and the methanol- 
derived fuel gas were almost the same, well explaining 
the similar I-V characteristics [257]. However, with 
increasing carbon number of alcohols, a decrease in 
H 2 and CO concentrations associated with a decrease 
in cell voltage [257] was observed. Therefore, a higher 
catalytic activity to reform alcohols at fuel electrodes 
(or pre-reforming) is necessary to optimize the perfor¬ 
mance of such direct-alcohol SOFCs. 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 27 

Cell voltage, anodic polarization, and anode-side IR loss, measured at 200 mA/cm 2 during H 2 S (5 ppm) poisoning at 800°C 
for the 50% pre-reformed CH 4 fuel with 5/C = 2.5 



Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Figure 28 

Microstructural change of anodes by various impurities: chlorine, siloxane, phosphorus, and boron. Nanocrystalline Ni 
precipitates after chlorine poisoning, melted Ni particles after phosphorus poisoning, and grown Ni grains after boron 
poisoning can be distinguished, while deposited Si0 2 (dark grains) filling the porous Ni-cermet (bright grain) anode can be 
seen in the cross section of the anode after the siloxane poisoning [262, 266] (Reproduced with permission from Elsevier) 
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Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 29 

Current-voltage characteristics of SOFCs for alcohol-based 
fuels at 1,000°C [257] (Reproduced by permission of ECS- 
The Electrochemical Society) 


Biogas 

The use of biogas as a fuel for SOFCs is of both tech¬ 
nological and environmental interest [3, 4, 251, 260, 
270, 275-278]. Biogas can be produced from biomass 
via various procedures such as fermentation and ther¬ 
mal decomposition. Organic municipal wastes 
(biowastes) are all metabolized by bacteria under 
anaerobic conditions producing a mixed gas consisting 
of ca. 60% CH 4 and 40% C0 2 . Anaerobic fermentation 
which proceeds in a reducing atmosphere enables us to 
extract chemical energy of biowastes as CH 4 -rich bio¬ 
gas. Figure 30 shows the equilibrium products from 
biogas initially consisting of CH 4 and C0 2 . High-tem- 
perature equilibrium gas from the biogas mainly consists 
of H 2 and CO, so that a comparable electrochemical 
performance with hydrogen-based fuels [275, 277] is 
expected. This figure also indicates that carbon forma¬ 
tion is thermochemically expected, so that an addition 
of H 2 0 (water vapor) and/or 0 2 is needed. 

As practical examples, Fig. 31 shows the cell volt¬ 
ages of internal reforming SOFC running on biogas 
measured at 200 mA cm -2 [278]. Simulated biogas 
with the CH 4 /C0 2 ratio of 1.5 had led to stable cell 
voltage above 0.8 V for 800 h without pre-reformer and 
humidifier. The degradation rate of only 0.4%/1,000 h 
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Equilibrium products from biogas (CH 4 60% and C0 2 40%). 
The calculations are performed by assuming that 1 kmol 
carbon is supplied 
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Electrodes, Fuels). Figure 31 

Performance of direct-biogas SOFC: 1 month test of 
internal reforming SOFC running on simulated and 
desulfurized actual biogases at 800°C using anode- 
supported button cells 


has proven that the biogas-fueled SOFC can be oper¬ 
ated in the internal reforming mode. As for the actual 
biogas, a rather high voltage comparable to that 
obtained by simulated biogas was achieved for 
1 month. Monitoring of biogas composition simulta¬ 
neously with the cell voltage has showed that voltage 
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fluctuation (a maximum of 50 mV level) is related to 
the fluctuation of the CH 4 /C0 2 ratio [270] (Fig. 32). 
Abrupt decrease in CH 4 concentration (or abrupt 
increase in C0 2 concentration) induced temporarily 
lower cell voltage. The CH 4 /C0 2 ratio in the actual 
biogas fluctuated between 1.4 and 1.7, corresponding 
to a CH 4 concentration ranging between 58 and 
63 vol%. Biogas composition is influenced by many 
factors, for example, kinds of organic wastes, their 
physical states, and operational conditions of methane 
fermentation such as temperature and pH of the waste 
slurry. 

It should be noted that carbon deposition was more 
noticeable for the desulfurized actual biogas compared 
to simulated biogas. Judging from the result of the gas 
analysis, carbon deposition is probably triggered by 
traces of H 2 S in the desulfurized biogas which can 
cause deactivation of Ni catalyst for the dry reforming 
of methane [270] . Acceleration of carbon deposition in 
the presence of trace H 2 S was also reported for SOFCs 
operated with partially reformed CH 4 -based fuels 
[262]. Air addition to biogas is effective in preventing 
carbon deposition [277]. Solid oxide fuel cells could 
therefore become a core technology in a carbon-neutral 
zero-emission renewable energy systems, as biogas can 
be efficiently converted to electricity and heat, while 
exhaust C0 2 gas may be reused in photosynthesis of 
biomass using solar energy. 


Coal Gas 

As one of the most abundant and inexpensive fossil 
energy resources [279], coal can be used to produce 
coal gas, with CO and H 2 as main constituents. Coal gas 
could be applied as a fuel for a combined-cycle power 
plant with coal gasifier, SOFCs, and gas (and steam) 
turbine(s) [2, 256, 280-284]. The concentration of 
water vapor and carrier gas in such a gas depends on 
the gasification procedures for the partial oxidation of 
coal by air, oxygen, or water [280]. It is therefore of 
technological relevance to analyze power generation 
characteristics of SOFCs operated with mixed H 2 and 
CO fuel gases, with consideration that coal gas is CO- 
rich in general. 

Figure 33 shows the I-V characteristics of a SOFC at 
1,000°C for different H 2 -to-CO ratios [256]. The open- 
circuit voltage increased slightly with decreasing frac¬ 
tion of CO gas, expected from the oxygen partial pres¬ 
sures in thermochemical equilibrium. Figure 33 shows 
that the I-V characteristics improved, that is, the cell 
voltage increased for a given current density with 
decreasing CO concentration of the mixed gas. This 
change in the I-V characteristics became significant 
with increasing current density, but this result clearly 
indicates that the mixture of CO and H 2 , the major 
constituents of coal gas, can be applied as SOFC fuels, 
provided that fuel impurity effects can be avoided. 
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Voltage fluctuation in synchronization with the fluctuation of biogas composition during the long-term test (initial 200 h 
of Fig. 31 is shown) 
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It should be noted that raw coal gas contains various 
kinds of fuel impurities, so that poisoning effects should 
be carefully examined. Influence of typical fuel impuri¬ 
ties in coal gas on electrochemical performance and 



Fuel Cells (SOFC): Alternative Approaches (Electroytes, 
Electrodes, Fuels). Figure 33 

Typical l-V characteristics of SOFCs for different H 2 -to-CO 
ratios at 1,000°C [256] (Reproduced with permission from 
ECS - The Electrochemical Society) 


microstructural stability has been extensively studied 
and discussed in the literature [262-266, 268, 285-290] . 

Future Directions 

The first-generation SOFC systems just commercial¬ 
ized are constructed using the most common materials 
such as stabilized Zr0 2 electrolytes, La(Sr)Mn0 3 cath¬ 
odes, and Ni cermets, and applying the most conven¬ 
tional fuels such as city gas. However alternative 
materials and fuels could expand the future applicabil¬ 
ity of SOFCs. Table 3 briefly summarizes the state-of- 
the-art and alternative materials for electrolytes, cath¬ 
odes, and anodes, as well as possible fuels and applica¬ 
tions. Advanced materials may offer alternatives too, 
for example, high-temperature polymer electrolyte fuel 
cells, direct-alcohol fuel cells, and portable fuel cells, as 
well as fuel cells for power generation, cogeneration, 
and transportation. Advanced fuel electrode materials 
may realize flexible SOFCs applicable to wide varieties 
of fuels. Nanostructural design may considerably 
improve their electrochemical performance, for exam¬ 
ple, by tailoring nano-size and stress effects, by reduc¬ 
ing grain boundary resistivity, as well as by introducing 
fast ionic/mixed conducting pathways. 

While most of the alternative materials and fuels 
mentioned in this entry have not yet been tested in fuel 
cell stacks/systems or even as a single cell, they may 


Fuel Cells (SOFC): Alternative Approaches (Electroytes, Electrodes, Fuels). Table 3 State-of-the art technologies and 
typical alternatives of SOFC materials, fuels, and applications 



Electrolyte materials 

Cathode materials 

Anode materials 

Fuels 

Possible applications 

State-of-the-art 

technologies 

Zr0 2 (Y 2 0 3 ) (YSZ) 
Zr0 2 (Sc 2 0 3 ) (ScSZ) 
La(Sr)Ga(Mg)0 3 

La(Sr)MnO s 

La(Sr)Co(Fe)0 3 

Sm(Sr)Co0 3 

Ni/YSZ 

Ni/ScSZ 

Ni/Ce0 2 (Gd 2 0 3 ) 

City gas, LP 
gas, H 2 

Stational 
cogenerations. 
Distributed power 
units, Combined power 
plants (SOFC/turbine) 

Typical 

alternatives 

Ce0 2 (Gd 2 0 3 ) 

Gd 2 (Zr x Ti 1 _ x ) 2 0 7 

Ba 2 ln 2 0 5 

L a 9.33Sie026 

La9.33Ge 6 0 26 

BaCe(Y)0 3 

BaZr(Y)0 3 

Ba(Sr)Co(Fe)0 3 

Gd(Sr)Co0 3 

Sm(Sr)Co0 3 

La 2 (Sr)Ni0 4 

Nd 2 Ni0 4 

Ag-La(Sr)Mn0 3 

La(Sr)Cr(Mn)0 3 

La(Sr)Cr(Ru)0 3 

Sr(La)Ti0 3 

CuYSZ(Ce0 2 ) 

Kerosene, 

Gasoline, 

Diesel, 

Alcohol, 

Biogas/ 

biofuel. Coal 

gas, DME 

Automotive APU/power 
units, Stational 
applications with 
kerosene, Coal gas 
power generation. 
Portable applications. 
Direct-alcohol SOFCs, 
Carbon-neutral power 
generation using 
biofuels 
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have potential for wider varieties of applications in the 
future. Among others, the following possibilities should 
be emphasized to consider future directions: (1) High 
electrical efficiency beyond 50% is attractive to bring 
about distributed power generation [291], where high 
ionic conductivity of the electrolyte, low electrode 
polarization, low contact resistance, as well as sufficient 
durability are essential. (2) Low-temperature SOFCs 
may actualize mobile applications of SOFCs where 
much higher ionic conductance and electrode activities 
should be tailored [15, 292-297] and more sophisti¬ 
cated materials design at microscopic levels becomes 
important [9, 13, 298-300]. (3) As the operational 
temperature of the low-temperature SOFC approaches 
that of high-temperature PEFCs, we may also learn 
much from materials and electrochemical research for 
PEFCs [301-305], which may lead to an integration of 
materials research between SOFC and PEFC. (4) Fuel- 
flexible SOFCs may contribute to energy security by 
using a wide variety of fuels and carbon-neutral power 
generation by using regenerable energy resources, for 
which robustness and poisoning tolerance of electrodes 
may be critical [255, 260, 262]. SOFC researchers 
should take responsibilities to fulfill their potentials 
and possibilities via concept demonstrations of oper¬ 
ating materials, devices, and systems. 

Readers may read references 9, 21, 23, 24, 26, 28, 29, 
79, and 100 to understand fundamental materials sci¬ 
ence related to the topic of this chapter. In order to 
understand the very early stage of this research field, 
readers may check references 39, 40, and 126. 
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into electrical energy in a continuous way. As in a battery 
the reactants are prevented from chemically reacting by 
separating them with an electrolyte, which is in contact 
with electro-catalytically active porous electrode struc¬ 
tures. Apart from effectively separating the anode and 
cathode gases and/or liquids, in other words the fuel 
and air, the electrolyte mediates the electrochemical 
reactions taking place at the electrodes by conducting 
a specific ion at very high rates during the operation of 
the fuel cell. In the simplest case of a fuel cell, operat¬ 
ing with hydrogen (fuel) and oxygen (air) as reacting 
gases, a proton or oxide ion current equivalent to the 
electronic current passing through the external load is 
driven through the electrolyte and parts of the hetero¬ 
geneous electrode structures (Fig. 1). 

The driving force of this process is the Gibbs energy 
(AG r ) of the reaction: 

H 2 + 1/2 o 2 ^h 2 o 

and the potential difference (electromotive force: emf) 
forming across the cell simply is: 

emf = — AG r /zF 

where z is the number of electrons driven through the 
outer circuit and F is the Faraday constant. 


e" 



Fuel Cells, Introduction. Figure 1 

Schematic illustration of the electrochemical energy 
conversion process in a fuel cell 


The maximum possible electric energy E reversible , 
which can be harvested from such reactions in a fuel 
cell, is: 

Ereversible — AG r 

and, if the efficiency r\ is defined as the ratio 
of the electrical energy taken out of the fuel cell and 
the reaction enthalpy AH r (heat of reaction), the effi¬ 
ciency limit: 

R reversible = AG r /AH r = 1 — (TAS r /AH r ) 

can even be higher than unity, provided the reaction 
entropy AS r is positive (note that AH r is negative for an 
exothermal reaction) like for the oxidation of carbon- 
rich fuels, for example: 

C 3 H 8 T 50 2 —> 3C0 2 T 4H 2 0 

The first law of thermodynamics is not violated 
because a fuel cell is a nonadiabatic system exchanging 
heat with the environment; hence, the missing energy 
can be taken from the environment by cooling 
the same. 

Looking at fuel cells in such a principle way may 
lead one to the conclusion that they are perfect energy 
conversion systems superior to any heat engine, for 
which the efficiency is strictly limited to this of the 
Carnot cycle. 

However, the issue is more complex, and it is still an 
open question which role fuel cells will play in future 
more sustainable energy supply systems. 

Following up on above efficiency considerations, 
it must be mentioned that the thermodynamic effi¬ 
ciency limits of all relevant fuel cell reactions are actu¬ 
ally below unity (in the range 80-100%) and that 
irreversibilities (losses) occur in practically all parts 
of every fuel cell (e.g., transport losses in the gas diffu¬ 
sion electrodes, over-potentials associated with the 
electrochemical reactions taking place at the anode 
and cathode, IR losses across the electrolyte and all 
interfaces). 

Nonetheless, electric efficiencies of more than 50% 
have been demonstrated experimentally, and, consid¬ 
ering the fact that heat engines (e.g., steam turbines) 
reach such high efficiencies only for very large units, 
the large fuel efficiency of small fuel cells appears to be 
a unique feature (Fig. 2). 
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Fuel Cells, Introduction. Figure 2 

Efficiency versus power for different fuel cell types compared to other energy conversion technologies 


Together with capital cost and operating cost 
advantages, this is making fuel cells an attractive alter¬ 
native especially in the sub-10-MW range. 

Despite the very demanding conditions in automo¬ 
tive applications and the more than 100 years of opti¬ 
mization of the internal combustion engine, the 
relatively high fuel efficiency is one of the reasons why 
fuel cells have a real chance to be applied even in this 
highly competitive market. For a couple of niche mar¬ 
kets (like outdoor, yachting), fuel cells are already well 
established. 

Especially for mass applications such as automo¬ 
tive, a full life cycle assessment (LCA) is absolutely 
critical. Since there are a lot of embedded resources 
such as noble metal catalysts and energy for 
manufacturing fuel cells, LCA very much depends 
on aspects such as recyclability and lifetime. For the 
operation period itself, cost, performance, and dura¬ 
bility issues are important. But a complete picture must 
also consider aspects of specific applications including 
how they are connected in future energy scenarios. 

Since fuel cells are just energy conversion devices, 
also the availability of appropriate fuels is essential for 
the potential role of fuel cells. The most advanced fuel 
cell types rely on natural gas (methane) or hydrogen as 
a fuel. 


Due to the high H/C ratio (which gives it some 
environmental advantage) and availability, methane 
(CH 4 ) may play a key role in the broad introduction 
of fuel cell technology (► Direct Hydrocarbon Solid 
Oxide Fuel Cells). 

Hydrogen can be directly obtained by reforming 
natural gas, but today it is mainly produced as 
a by-product only, that is, hydrogen is not being used 
extensively as an energy carrier. Although, with hydro¬ 
gen, one can store large amounts of energy which can 
be distributed in relatively easy ways, the hydrogen 
storage problem is not fully solved yet. Also the ways 
of producing hydrogen, including the primary energy 
used for this, bear a lot of open questions. While some 
countries are even considering the use of excess nuclear 
energy (electricity and heat) for producing hydrogen, 
others want to rely completely on renewables such as 
wind and solar. 

I hope, these few considerations make it clear that 
the future of fuel cells may heavily depend on severe 
bifurcations. Heat engines and electrochemical energy 
storage devices (mainly batteries) existed side by side in 
the late nineteenth and the early twentieth century, but 
the abundance of fossil fuels has later driven the devel¬ 
opment of combustion engines and turbines retarding 
the development of alternative energy conversion 
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devices such as batteries and fuel cells. It could well be 
that the expected end of the “oil age” may also go along 
with an increasing importance of alternative energy 
conversion devices, among which fuel cells have the 
potential to play an important role. Much will depend 
on the societies driving this development. Their under¬ 
standing of “Sustainability” will be different depending 
on the relative importance of its different aspects, such 
as preservation of natural resources, environmental 
impact, safety and reliability. Moreover, the temptation 
to benefit from short-term “economical advantages” 
may strongly bias the decisions to come. 

Making large quantities of hydrogen with sunlight, 
directly or indirectly, is still far away, but there is no 
principal physicochemical law preventing this option, 
which seems to be one of the long-standing driving 
forces for the many fuel cell activities around the globe. 

The entries of the following part on fuel cells will 
strictly keep out of such aspects which are highly spec¬ 
ulative at this stage. They rather focus on the technical 
issues comprising descriptions of the state of the art 
and remaining problems, approaches to solve the same, 
and some speculations about future directions. This 
volume is meant to provide a solid ground for taking 
decisions. There is some redundancy and overlap 
between entries reflecting different points of view 
which naturally exist even for the technical issues. 

Historically, the fuel cell concept is known for more 
than 170 years. It was Christian Friedrich Schonbein 
who recognized and described the appearance of 
“inverse electrolysis” [1], shortly before Sir William 
Grove, the inventor of the platinum/zinc battery, 
constructed his first “gas voltaic battery” [2]. Grove 
already used platinum electrodes and dilute sulfuric 
acid as a proton conducting electrolyte, and it is 
worth mentioning that today’s low-temperature PEM 
(polymer electrolyte membrane) fuel cells still make 
use of related materials (carbon-supported platinum 
and perfluoro-sulfonic-acid (PFSA)-ionomers). The 
same is also true for high-temperature fuel cells which 
are dating back to the 1930s. In an attempt to prove 
the concept of “Brennstoff-Ketten mit Festleitern” 
(fuel chains with solid conductors), Emil Baur and 
Hans Preis tested different solid electrolytes [3], and 
they came to the conclusion that the “Nernst-Masse ist 
unubertroffen.” At that time “Nernst-Masse” had the 


composition 85% Zr0 2 , 15% Y 2 0 3 , which is still very 
close to what is used in current SOFCs (solid oxide fuel 
cell) as electrolyte material (► Solid Oxide Fuel Cell 
Materials: Durability, Reliability and Cost). Moreover, 
they tested “masses” containing ceria (Ce0 2 ), which is 
still discussed as a base for alternative electrolyte mate¬ 
rials (►Fuel Cells (SOFC): Alternative Approaches 
(Electroytes, Electrodes, Fuels)), and they even 
discussed power and cost issues of their system which 
actually used coal as a fuel (power densities of 10 kW/ 
m 3 at a cost of 1,000 Mark/kW were estimated). 
Although the numbers for both are much higher 
today, the factor of 4, estimated for the cost disadvan¬ 
tage compared to the cost of conventional technologies 
seemed to persist over time. 

Current R8cD work is still focusing on PEM fuels 
and SOFCs operating at low (sub-freezing to T ~ 90°C) 
and high temperature (T = 700 to above 1,000°C), but 
the intermediate temperature range (150-700°C) is 
progressively attracting more attention. After introduc¬ 
ing the most relevant fuel cell types (► Fuel Cell Types 
and Their Electrochemistry) the remaining entries 
of this volume are actually ordered by the operation 
temperature of the considered fuel cells (from high 
to low T) touching upon various aspects such as 
material science, engineering, economical issues, and 
the possible implications for more sustainable energy 
conversion scenarios. 

The high-temperature fuel cell part has a special 
entry on “Direct Hydrocarbon SOFCs” (►Direct 
Hydrocarbon Solid Oxide Fuel Cells) which are very 
interesting as a bridging technology, because they still 
run fossil fuel (natural gas), but may provide an envi¬ 
ronmental advantage through a higher efficiency. Since 
durability and reliability issues are critical for SOFCs 
they are discussed in a separate entry (► Solid Oxide 
Fuel Cell Materials: Durability, Reliability and Cost) 
before introducing alternative approaches with respect 
to the selection of materials and fuels (► Fuel Cells 
(SOFC): Alternative Approaches (Electroytes, Elec¬ 
trodes, Fuels)). The subsequent entry is related, addi¬ 
tionally discussing devices and configurations and the 
matter of in situ diagnostics (► Solid Oxide Fuel Cells). 
Finally, specific SOFC design concepts are described 
and sustainability aspects including full life cycle ana¬ 
lyses (FCA) are discussed (►Solid Oxide Fuel Cells, 
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Sustainability Aspects) before market issues are 
addressed (► Solid Oxide Fuel Cells, Marketing Issues). 

The section on intermediate temperature fuel 
cells has just one entry on each fuel cell type. 
With decreasing operation temperature, the “Molten 
Carbonate Fuel Cell” technology is critically discussed 
(►Molten Carbonate Fuel Cells) before two related 
systems relying on the unique proton conducting prop¬ 
erties of phosphoric acid are described. While the well- 
established phosphoric acid fuel cell (PAFC) is devel¬ 
oped for stationary applications (► Phosphoric Acid 
Fuel Cells for Stationary Applications), polybenzi¬ 
midazole (used as a matrix for phosphoric acid) 
fuel cells even have some potential for mobile and 
small applications (► Polybenzimidazole Fuel Cell 
Technology). 

The subsequent part touches upon the various 
aspects of PEM fuel cells which have seen tremendous 
progress over the last two decades. But there are still 
several critical materials issues which are related to the 
cost and durability requirements especially for auto¬ 
motive applications and which are identified in the first 
entry (► PEM Fuel Cells, Materials and Design Devel¬ 
opment Challenges). The following entries are 
discussing specific components which are important, 
and in most cases also critical, for the function of the 
PEM fuel cell system. There are three entries on mem¬ 
branes which may be considered to be the “heart” of 
any PEM fuel cell. In most cases, this is still a PFSA 
(perfluoro-sulfonic-acid) membrane, but more 
recently also hydrocarbon membranes have been 
used, not only because of cost advantages but also 
because of their distinctly different properties (e.g., 
lower gas permeation, lower electro-osmotic drag, 
higher morphological stability). The various aspects 
of this move are described (► Membrane Electrolytes, 
from Perfluoro Sulfonic Acid (PFSA) to Hydrocarbon 
Ionomers) before the issue of operation at high tem¬ 
perature and low humidity is addressed (► Proton 
Exchange Membrane Fuel Cells: High-Temperature, 
Low-Humidity Operation). The latter may help to 
reduce the system cost especially by reducing the size 
of the cooling system, the amount of noble-metal cat¬ 
alyst, and the purity requirements for the fuel used 
(hydrogen). Cost reduction by using non-noble metal 
catalysts is the motivation for the development of 


better anion exchange membranes which is described 
in the following entry (►Alkaline Membrane Fuel 
Cells). The part on electro catalysts and electrode struc¬ 
tures contains an entry on noble metal catalysts 
(►PEM Fuel Cells and Platinum-Based 
Electro catalysts), including current attempts to 
improve both activity and durability, while the follow¬ 
ing entry presents a radically different approach relying 
on the activity of some non-noble metals for the oxy¬ 
gen reduction reaction (► Polymer Electrolyte Mem¬ 
brane Fuel Cells (PEM-FC) and Non-noble Metal 
Catalysts for Oxygen Reduction) which may also help 
to bring cost down. Since PEM fuel cell systems are 
generally more complex than those of other fuel cell 
types, this section commences with an entry spanning 
the bridge from a few fundamentals to the complete fuel 
cell system including cost and reliability considerations 
(►Polymer Electrolyte (PE) Fuel Cell Systems). The 
implementation of PEM fuel cells into cars is described 
in an entry on automotive applications (► Polymer 
Electrolyte Membrane (PEM) Fuel Cells, Automotive 
Applications). The last entry of this part has some 
overlap with the first discussing cost, performance, 
and durability issues of existing systems (► PEM Fuel 
Cell Materials: Costs, Performance and Durability). 

The present volume is completed by a critical com¬ 
parison of fuel cell technologies to other technologies 
beyond a one-to-one comparison (► Fuel Cell Com¬ 
parison to Alternate Technologies). 

Energy conversion by fuel cells has the genuine 
advantage of high efficiency in a large power range 
(Fig. 2) which is giving them a high flexibility as part 
of complex energy conversion and distribution scenar¬ 
ios. The synergetic combination with other technolo¬ 
gies (e.g., fuel cells serving as range extenders in electric 
vehicles) could even help to overcome remaining crit¬ 
ical issues such as cost. There has been tremendous 
progress with respect to cost reduction and increase 
in power and lifetime which has taken fuel cells into 
some niche markets (e.g., for military and remote 
applications), but their widespread use still requires 
further fundamental research and engineering efforts. 
This volume provides detailed insight into the current 
status of the diverse fuel cell technologies and identifies 
future directions based on critical analyses of the state 
of the art. 
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